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coefficient of recombinant protein by modifying the product 
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It is known to enhance the partitioning ability of cells and 
proteins by fusing a protein of interest to a targeting 
protein. The known proteins contain tryptophan residues . In 
the description of the present application p 6, lines 27-31 it 
is stated that the targeting proteins of the invention are/can 
be larger than in the known small peptide tags of 12 amino 
acids. It is also stated the claimed proteins not necessarily 
contain tryptophan residues, but that they may do so. The 
scope of claim 1 covers all amphipathic ' and hydrophobic 
proteins that can be fused to a protein of interest to improve 
the ability to partition in ATPS, irrespective of if the 
contain tryptophan residues or not. The proteins in document B 
are covered by claim 1 in the present application . 

Document C discloses a method for immobilising a binding 
protein capable of binding to a specific compound by linking 
it to the outside of a host cell . This protein is useful in 
protein isolation processes. 

Documents D-E provide recombinant DNA sequences encoding for 
hfbl and hfb2 and their ability to fuse antibodies. 
"Documents C-E show the general state of the art. 

Thus , claim 1 is not considered to fulfil the requirements of 
novelty and inventive step. 

Claims 2-24 are considered to fulfil the requirements of 
novelty, inventive step and industrial applicability. 
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PROCESS FOR PARTITIONING OF PROTEINS 
Field of the invention 

5 The present invention relates to isolation and purification of proteins in aqueous two-phase 
systems (ATPS). Specifically the invention provides processes for partitioning of molecules 
of interest in ATPS by fusing said molecules to targeting proteins which have the ability of 
carrying said molecule into one of the phases. 

10 Background of the invention 

Liquid-liquid extraction in an aqueous two-phase system (ATPS) can offer a powerful 
technique for isolation and purification of proteins. The separation of macromolecules and 
particles by means of liquid-liquid extraction is well known (Albertsson, 1 986; Walter et al. , 
15 1985; Kula, 1990). Mainly polyethylene glycol (PEG) - salt, PEG-dextran and PEG-starch 
systems have been in use. More recently detergents and detergents with reversed solubility 
were discovered as suitable methods for separation of macromolecules, and especially for 
the separation of proteins. 

20 An advantage of aqueous two-phase systems (ATPS) is that they are especially suited for 
large scale processing of microbial proteins not only from culture supernatants but also from 
crude extracts containing cells and eel! debris (Kula, 1979; Kula, 1985). Characteristic 
features of biological fluids as well as suspensions are rather small particle sizes, low density 
differences between fluid and suspended solids, high viscosities of the extracts and high 

25 compressibility of the solids (Hustedt et al., 1985; Bender and Koglin, 1986). These 
attributes decrease the performance of conventional methods for solid-liquid separation like 
centrifugation and filtration at the beginning of a protein recovery process. Using an 
aqueous two-phase system removal of solids can be integrated into a liquid-liquid separation 
step, clarification is thus combined with an initial purification (Kula, 1979; Kuia, 1985). 

30 

After the extraction process phase separation can be accomplished by settling under gravity 
as well as by centrifugation (Kula, 1985). ATPS can be applied in various scales from very 
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small laboratory scale to large industrial scale thus suiting for various proteins, purposes and 
needs. With regard to industrial purposes commercially available centrifugal separators can 
be used to shorten separation time. Several authors have investigated the potential of 
centrifugal separators of various design for processing of large volumes of aqueous two 
5 phase systems (Kula, 1979; Kula et ai, 1981, Kula et al., 1982; Kula, 1985). In these 
studies the authors have used polymer/polymer or polymer/salt systems and the results of 
these investigations demonstrate the feasibility of continuous separation of aqueous two- 
phase systems in centrifugal separators. 

Extraction systems based on nonionic surfactants have been described as an alternative to 
standard polymer/polymer or polymer/salt systems. Phase forming surfactants are e.g! 
polyoxyethylene type nonionic detergents. The basis of this type of aqueous two-phase 
system is the temperature-dependent reversible hydration of the polar ethylene oxide head 
groups. The temperature at which the phase separation occurs is referred to as the cloud- 
point (cloud-point extraction). This kind of aqueous two-phase system is especially suited 
for the extraction of amphiphilic biomolecules. The potential of this type of two-phase 
system for separating membrane bound proteins from cytosolic and peripheral membrane 
proteins was first demonstrated by Bordier (1 98 1 ). Heusch and Kopp (1 988) have been able 
to demonstrate that lamellar structures formed in the miscibility gaps of polyglycol ether / 
water systems are responsible for the selective extraction of hydrophobic substances. 

Recently, the successful application of a surfactant-based aqueous two-phase system for the 
extraction of a membrane bound protein (cholesterol oxidase) from the unclarified culture 
medium of the gram-positive microorganism Nocardia rhodochrous on a bench scale has 
25 been reported (Minuthe* ai, 1995). By addition of only one chemical compound a product 
release through solubilization was possible in homogeneous phase and in a second step a 
clarification as well as an initial purification was achieved by an extraction process at 
elevated temperatures separating the detergent rich phase. A closed concept was further 
developed for the production of the membrane bound enzyme by surfactant-based 
30 extraction, organic solvent extraction and anion-exchange chromatography, which gave a 
product suitable for analytical applications (Minuth et a!., 1996). 
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In aqueous two phase systems the desired target e.g. a protein should partition selectively 
into one phase (preferentially the lighter phase) while the other substances should partition 
into the other phase (preferentially the heavier phase). In PEG/salt and PEG/dextran and 
similar systems there are several driving forces for a substance like charges, hydrophobic, 
5 hydrophilic forces or the dependence on conformation or ligand interaction (Albertsson, 
1986). The forces leading to separation in detergent based aqueous two phase systems are 
suggested to be primarily hydrophobic (Terstappen et al. y 1993). Even if a lot of work has 
been carried out in the field of prediction in ATPS, none of the designed models provides 
a physical picture of the phase behaviour and prediction is hardly possible (Johansson et al. , 



In ATPS the partitioning coefficient is defined as the concentration (activity in case of an 
enzyme) of the target in the top phase divided by the concentration (enzyme:activity) of the 
target protein in the bottom phase. Partitioning coefficients in ATPS systems are usually in 



15 the range from less than 1 up to less than 1 00 (Terstappen et al. , 1 992; Terstappen et al. , 
1993). 



Yield: is defined as the amount of target in the top phase divided by the sum of the amount 
of target in top and bottom. This leads to the following equation 
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1998). 




20 Y 7 = — = 
1 + 
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If the desired substance is directed to the heavier phase (as it can be the case using Triton) 
the yield is defined by 
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The volume ratio of the two coexisting phases are defined by the volumes of the lighter over 
the heavier phase, respectively. 

An example of useful proteins facing problems in purification in a cost-effective way are the 
commonly used industrial enzymes used as biocatalysts, the glycosyl hydrolases, proteases 
and lipases produced by fungi and bacteria. These are used in e.g. laundry, textile, paper and 
pulp, food and feed industry. The fact that microbes produce many different enzymes during 
their growth and the fact that some of these may be undesired in certain applications leads 
to a need to enrich the active component(s). This enrichment can be performed by choosing 
appropriate growth conditions, by genetic engineering and/or by down-stream processing 
(e.g. purification of the active component(s)). 

Purification of proteins are generally performed by chromatography. Usually gel- 
chromatographic methods are used based on ion-exchange, hydrophobic interaction, affinity 
chromatography and molecular sieving. Methods like electrophoresis and crystallisation can 
also be used. These methods are well known in the art and suitable for proteins of fairly high 
market value. In case of bulky enzyme production these methods, however, are too 
expensive in order to keep the final product on a compatible price level. Due to similar 
properties of these enzymes several purification steps are usually needed to separate the 
proteins from each other. This often causes low final yields and therefore a high loss of 
product. 

Many extracellular hydrolases produced by the filamentous fungus Trichoderma are 
currently used in different industrial applications in large scale. These hydrolases are e.g. 
hemicellulases (such as xylanases and mannanases), cellulases (such as endoglucanases and 
cellobiohydrolases) and proteases. Purification of these is well known in the art (Bhikhabhai 
et al. t 1984; Peree/a/., 1995), but for large industrial applications the purification methods 
are too expensive. Alternative methods to enrich these hydrolases have been used, including 
deletion of undesired genes by genetic engineering (Suominen et aL, 1 992). However, even 
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after extensive genetic engineering some minor undesired activities may still be present in 
the final product. 

ATPS have been studied in purification of cellulases of T. reesei and the purification of an 
5 endoglucanase III showed some promising results, enriching the yield of the protein in the 
upper phase (U.S. Pat. 5,139,943). ATPS have also been studied in purification of lipases, 
endoxylanase and natamycin (EP 0 574 050 Al). No K and Y values are, however, 
mentioned. 

As in other protein purification methods, similar properties of proteins produced by an 
organism impair also in ATPS, e.g. selective separation of one protein is not achieved 
optimally. To obtain selectivity in purification affinity chomatographic methods are used 
especially for analytic purposes and in purification of high- value products. These include 
immunoaffinity chromatography and various fusion protein strategies well described in the 
art such as fusing the protein of interest to an other protein (e.g. glutathione-S-transferase), 
protein domain (e.g. protein A-ZZ domain) or small peptide (e.g. His-tag), which selectively 
bind to the solid carrier and thus the recovery of the fusion partner is obtained as well. The 
fusion protein can be suitable for the particular purpose as such or cleavage of the product 
from the added fusion partner may be desired. There are well-known methods in the art on 
cleavage of fusion proteins from their partners by proteases, e.g. by factor X, thrombin or 
papain or by genetically introducing a protease cleavage site (e.g. Kex2 site) or autoproses- 
sing domains (e.g. Intein, New England Biolabs) or by chemical cleavage (e.g. CNBr). 

ATPS offer advantages mainly with respect to technology compared with the solid state 
25 based separation systems e.g. affinity column-based techniques. The scale-up of extractive 
enzymes is relatively simple utilising commercially available equipment and machinery 
common in the chemical industry. In addition, it can be used in a continuous process and it 
can be relatively cost extensive. It can be used as a single step for clarification, concentration 
and purification. ATPS can be used as a first capture-step, but for bulk products often no 
30 further purification is needed. 
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To aid selective separation in two-phase systems, recent publications have described the 
fusion of small peptide tags of 12 amino acids to the protein to be purified. The most 
successful of these soluble peptides are containing tryptophans. So far they have mainly been 
applied for very small molecules like the staphylococcal protein A derivative ZZTO 
5 (Berggrene/a/., 1999; Hassinen etal, 1994; Kohler etal, 1991). 

Use of ATPS has so far been limited to certain targets. Due to the advantages of ATPS in 
protein separation, purification and localisation, highly selective and powerful methods 
should be developed. This is especially important for large scale processes where ATPS in 
10 general is very inexpensive as a first capture step or as the only step for purification, 
clarification and concentration. The system should be universal so that the technique would 
be strong enough to mediate separation of in principle any component to the desired phase 
irrespective of its size or biochemical properties. 

15 Description of the invention 

In this invention we describe selective separation and partitioning of molecules and particles 
fusing them with targeting proteins having the capability to carry the molecule or particle 
of interest to the desired phase in ATPS, and to keep it in this phase if wanted. This 

20 invention is directed to make ATPS usable for every biotechnological product. By addition 
of the targeting protein to selective products, either by genetic tagging of proteins, by 
chemical binding, glueing or by use of any other technique, the product molecule can be 
turned more suitable for separation in ATPS. Using ATPS the product or certain component 
is therefore driven to one phase while the other components or by-products are directed to 

25 the other phase(s). 

We also describe that efficient separation in ATPS can be obtained using targeting proteins 
which are/can be larger than the described small soluble synthetic peptide tags of 1 2 amino 
acids or less. These targeting molecules can aid in separating of small molecules but even 
30 large proteins and particles. Unlike the small peptide tags, it is not necessary that they 
contain tryptophan residues, although they may do so. They can be hydrophobic or 
moderately hydrophobic and/or amphipathic in nature, either in monomeric form or when 
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forming aggregates. Such proteins can be found in nature or they can be designed, or 
obtained through for instance methods known in art for mutant generation, gene shuffling 
or directed evolution. Suitable targeting molecules can be screened for instance by fusion 
the product of interest to a library of natural or mutant sequences, and screening the ability 
5 of the fusion molecules to separate in ATPS. Furthermore, any molecule capable of 
separating in ATPS is a suitable targeting molecule. 

Several techniques exploiting purified protein for isolation of the corresponding gene may 
be used to find genes encoding suitable targeting molecules for ATPS. Suitable proteins or 

10 polypeptides may be purified on the basis of their properties. They can be obtained by 
applying the cells, eel! extracts or culture media to ATPS and recovering the proteins or 
peptides separated into the phase containing the hydrophobic phase material. Suitable 
targeting molecules may also be recovered for example from the culture medium foam 
formed either during the cultivation of a microorganism or caused by bubbling gas through 

15 the medium. Proteins and peptides suitable as targeting molecules may further be recovered 
from aggregates caused by freezing of culture media. After the targeting molecules have 
been purified, the corresponding genes are isolated using techniques known to a person 
skilled in the art. Such techniques include for example screening of expression libraries using 
antibodies raised against purified polypeptide or peptide, and PCR cloning and screening of 

20 genomic and/or cDNA libraries with oligonucleotides designed on the basis of N-terminal 
or internal protein sequences. 

Examples of molecules suited as targeting proteins in ATPS found in nature are 
hydrophobin-like small proteins. Hydrophobins are secreted proteins with interesting 

25 physico-chemical properties that have recently been discoverd from filamentous fungi 
(Wessels, 1994; Wosten and Wessels, 1 997; Kershaw and Talbot, 1998). They are usually 
small proteins, approximately 70 to 160 amino acids, containing eight cysteine residues in 
conserved pattern, and do usually not contain tryptophanes. However, also multimodular 
proteins with one or several hydrophobin domains and e.g. proline-rich orasparagine/glycine 

30 repeats, or hydrophobins containing less than eight cysteine residues have been characterized 
(Loraera/., 1994; Lora era/., 1995; Arntz and Tudzynski, 1997). Hydrophobins have been 
divided into two classes based on their hydropathy profiles (Wessels, 1994). 
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Today most protein data exists for the hydrophobic Sc3p of 'Schizophyllum commune (class 
I), and cerato-ulmin of Ophiostoma vlmi and cryparin of Cryponectria parasitica (class II), 
although more than 30 gene sequences for hydrophobins have been published (Wosten and 
5 Wessels, 1997). HFB genes are often naturally highly expressed but due to special 
requirements in cultivation conditions and the biochemical properties of the proteins, 
purification of HFBs in large amounts have been difficult. For instance only relatively 
moderate production levels of a few mg per liter of Sc3 hydrophobin in static cultures are 
obtained (Han Wosten, personal communication). Published purification procedures include 
10 e.g. multi-step extraction from fungal cell walls using organic solvents and bubbling or 
freezing of culture nitrates (Wessels, 1994). No reports of successful production of 
hydrophobins are available; levels of cerato-ulmin were no higher than those obtained with 
other naturally occuring fungal isolates (Temple et al. y 1 997). 

15 Upon shaking hydrophobin-containing solutions, the protein monomers form rodlet-like 
aggregates. These structures are similar to the ones found on surfaces of aerial stuctures. 
The self-assembly of purified Sc3 hydrophobin into a 10 nm thick amphipatic layer on 
hydrophilic and hydrophobic surfaces has been demonstrated (Wosten et al , 1 994a; Wosten 
et al. , 1 994b). This film is very strongly attached to the surface and not broken, for instance, 

20 by hot detergent. The hydrophobic side of the layer on hydrophilic surfaces shows properties 
similar to teflon (Wessels, 1 994). The Sc3 assemblages, as well as those of cerato-ulmin and 
cryparin, also form on gas-liquid or gas-air interphases thus stabilizing air bubbles or oil 
droplets in water. 

25 Surface activity of proteins is generally low but hydrophobins belong to surface-active 
molecules, their surfactant capacity being at least similar to traditional biosurfactants such 
as glycolipids, lipopetides/lipoproteins, phospholipids, neutral lipids and fatty acids (Wosten 
and Wessels, 1997). In fact Sc3 hydrophobin is the most potent biosurfactant known. It 
lowers the water surface tension to 24 mJm 2 at a concentration of 50 //g/ml due to a 

30 conformational change during self-assembly ofmonomers into an amphipathic film (Wosten 
and Wessels, 1997). 
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Hydrophobin-like molecules vary in their properties. For instance, rodlet-forming capacity 
has not been assigned for all hydrophobins (such as some class II), or they might have a 
weaker tendency to form stable aggregates (Russo et ah, 1992; Carpenter et ai., 1992). 
Another group of fungal amphiphatic proteins are repellents ( Wbsten et al. , 1 996 (Ustilago), 
5 for review, see Kershaw and Talbot, 1 998). Consequently, other type of proteins suited as 
targeting proteins for ATPS, may have only some of the features assigned to hydrophobins. 
Other suitable proteins are hydrophobic ones such as e.g. lipases, cholesterol oxidase, 
membrane proteins, small peptide drugs like nisin, aggregating cell wall proteins, lipopetides 
or any parts of these or combinations of these, and other molecules like glycolipids, 
10 phospholipids, neutral lipids, fatty acids in combination with proteins or peptides. 

In this invention the targeting protein, such as a hydrophobin-like protein or parts of it, is 
bound to the product molecule or the component to be separated. First, phase forming 
materials and eventually possibly also additional salts are added to a watery solution 

15 containing the fusion molecule or component, and optionally also the contaminating 
materials. The added agents are mixed to facilitate their solubilization. As soon as they are 
solubilized the two phases are formed either by gravity settling or centrifiigation. In the 
separation the target protein drives the product to for instance the detergent-rich phase 
which could either be the top or the bottom phase. The method is not only useful for 

20 purification of products of interest but also for keeping the product or the component of 
interest, such as a biocatalyst, in a particular phase which enables certain useful biotechnical 
reactions. 

Several ATPS systems are suitable for performing this invention. These include PEG 
25 containing systems, detergent based systems and novel thermoseparating polymers. 
Detergent based systems can be nonionic, zwitterionic, anionic or kationic. The system can 
be based on amphiphilic polymeric detergents, micelle forming polymers. Novel polymers 
can be based on polyethylene-polypropylene copolymers such as pluronic block copolymers, 
Brij, polyoxyethylene derivatives of partial ethers of fatty acids made by adding polyoxyet- 
30 hylene chains to the nonesterified body and polyoxyethylene derivatives. The well known 
PEG/salt, PEG/dextran and PEG/starch (or derivatives such as Reppal, hydroxipropyl 
starch) systems where PEG and water are forming the top phase and dextran/starch/salt and 
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water are forming the bottom phase. As salts are used phosphate, citrate, sulfate or others. 
In the present process the target is partitioning mainly to the top phase, while most of the 
contaminants are separating mainly to the bottom phase. Some hydrophobic contaminants 
might partition to the top phase as well. Using detergent based systems only one phase 
5 forming detergent has to be added. Optionally, salts and other chemicals can be used in 
addition. The mentioned chemicals are added, and the solution is mixed. After mixing the 
separation can take place either by centrifiigation or gravity settling. In order to separate 
into two phases the temperature of the solution has to be over the cloud-point of the 
detergent. The solution has to be heated if the cloud-point is not reached otherwise. If 
1 0 wanted, a second separation step can follow after a first extraction step and the product rich 
phase can be further purified. Also the remaining product in the product poor and by- 
product rich phase can be re-extracted. Very good K values can be obtained and the yields 
and concentration factors are high. 

15 The process of the present invention can advantageously be used in laboratory scale but is 
especially suitable for large scale separations. It can successfully be used in the separation 
of proteins and components from large fermentations. Using genetic modifications, the 
method can be used to purify any protein of interest including extracellular enzymes and 
proteins such as cellulases and hemicellulases from mixtures containing large amounts of 

20 protein such as several grams per liter. Furthermore, this separation can be obtained from 
various culture media including industrial media containing particular materials such as 
cellulose and spent grain. The method can be used to purify the product from culture media 
of strains modified not to produce endogenous hydrophobias. The separation can be done 
directly from the fermentation broth which can additionally contain cells, even viscous 

25 filamentous fungi. High biomass levels are acceptable for the process as explained in 
example 9. An example is the extracellular endoglucanase I from the fungus Trichoderma 
reesei which can be tagged for instance with the class 2 HFBI and can for example be 
separated with the nonionic polyoxyethylene C12-C18E05. In this example the detergent 
rich phase is the lighter phase and contains most of the tagged endoglucanase while most 

30 of the other cellulases, proteases and other enzymes remain in the heavier phase. The 
mycelium separates to the bottom phase, too. The separation can be achieved using 
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separation temperatures higher than 25 °C. The temperature can be decreased if certain salts 
like NaCl or K 2 S0 4 are added. 

The invention describes separation of molecules produced in various different organisms 
5 such as bacteria, yeast and filamentous fungi. The invention is suitable for purification of 
product molecules from extra- or intracellular locations, including cell wall bound molecules. 
It provides examples how the fusion molecule can be secreted by these different organisms 
but also provide an example how the fusion can be produced intracellularly. 

10 The invention further describes how fusion molecules consisting of several domains can be 
constructed and successfully expressed and produced. The invention describes fusions of the 
targeting molecule to a small protein (CBD), to a moderately sized protein (EGI) and to a 
huge highly glycosylated protein (Hoi), and different domain variations of these. These 
molecules can be ready as such for biotechnical use. Alternatively, the product can be 

15 cleaved from the targeting protein by any method known in the art such as with proteases 
e.g. thrombin, factor X, papain or by chemical cleavage. Furthermore, ATPS is a preferential 
means to be used to separate the product from the targeting protein after cleavage, or these 
can be separated with other methods known in the art. 

20 A suprising feature is that the targeting protein can also be used to carry large particles to 
the desired phase in ATPS. This can be obtained if the particles already contain proteins 
suited for targeting such as spores/conidia do in case of fungi. The targeting protein can also 
be attached to the particles or compounds in vitro. If cells are separated, the targeting 
protein can alternatively be expressed in the recombinant cells in such a way that it is 

25 exposed at the cell surface whereby it mediates the separation of the cells in ATPS. A 
teaching how this can be done is provided in example 22. Other types of molecules which 
direct the targeting molecule to the cell surface can be found e.g. in the literature including 
bacterial outer membrane proteins and lipoproteins (Stahl and Uhlen, 1 997), and yeast 
proteins a-agglutinin and flocculin (Schreuder etal, 1996; YMetal. (1994) WO 94/01 567, 

30 Frenken (1994) WO 94/18330). 
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A further advantage of the system is that the invention combined with ATPS provides a 
means to separate the product or desired component not only from other unnecessary or 
unwanted proteins but also from harmful proteins such as proteases as described in example 
6. Thus, the invention is particularly suited for production and purification of heterologous 
5 proteins, e.g. sensitive mammalian proteins usually produced in limited amounts in 
heterologous hosts. Such proteins are for instance antibodies or fragments thereof, 
interferon, interleukin, oxidative enzymes and any foreign protein which can otherwise be 
produced in a host. It is possible that separation of the product from e.g. culture medium 
can also be obtained on-line or semi-continuously, thus minimising the effect of proteases 
10 or other harmful components present in the culture. When produced intracellular^, the 
invention also provides means to separate the heterologous product, for instance the 
inclusion bodies it may form, from the cellular extracts. 

This invention describes for the first time that fusion proteins containing hydrophobin-like 
15 molecules can be made and produced in significant amounts despite the very particular 
properties of hydrophobin-like molecules. Importantly, this invention also describes how 
recombinant strains producing increased amounts of hydrophobin-like proteins as such can 
be made. This provides means to produce the targeting protein for uses in which it is wanted 
that the targeting protein is bound to the product or particle in vitro, to enable further 
20 separation of such molecules or particles in ATPS. 

Importantly, this invention also decsribes how hydrophobin-like molecules can be purified 
in ATPS very efficiently with high K-values. The molecules can be separated in the same 
way as the above mentioned fusions, for instance by PEG systems of by detergent-based 
25 systems. Separation can be done from the culture medium or from cells. This provides a 
significant improvement in making pure preparates containing hydrophobin-like molecules 
since due to their properties their purification is very complicated and results in losses with 
the previously reported techniques as described above. 

30 Brief description of drawings 



Fig. 1 



shows the map of the plasmid pMQ103. 



13 

shows the map of the plasmid pMQl 1 3 . 
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shows the map of the plasmid pMQl 04. 
shows the map of the plasmid pMQl 14. 
shows the map of the plasmid pMQ105. 
shows the map of the plasmid pMQl 15. 
shows the map of the plasmid pMQ 121. 

Coomassie-stained 10% SDS-PAGE of the partitioning of EGIcore-HFBI 
fusion protein in two-phase separation using 5 % of the detergent CI 2- 
C18E05. Lane 1, Molecular weight marker; Lane 2, Purified CBH3 (4 ug); 
Lane 3, Purified EGI (4 ug); Lane 4, 1/10 diluted VTT-D-98691 cellulose- 
based culture filtrate; Lanes 5 and 6, 1/10 diluted bottom phase and detergent 
phase (top phase), respectively, after separation of VTT-D-98691 culture 
filtrate with 5% detergent; Lane 7, Non-diluted bottom phase; Lane 8, Non- 
diluted VTT-D-98691 cellulose culture filtrate. 

Western analysis of the partitioning of EGIcore-HFBI fusion protein in two- 
phase separation by using different concentrations of the detergent C12- 
C18E05. Fusion proteins were detected with anti-HFBI antibodies. Lane 1, 
Molecular weight marker; Lane 2, Purified EGI; Lane 3, VTT-D-98691 
cellulose culture filtrate; Lanes 4 and 5 } Detergent phase (top phase) and 
bottom phase, respectively, after separation of VTT-D-98691 culture filtrate 
with 5% detergent; Lane 6, Same as lane 3 S except 2 % detergent was used; 
Lane 7, Same as lane 4, except 2 % detergent was used; Lane 8, Purified EGI; 
Lane 9:, Purified CBHI. 
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Fig. 10 Coomassie-stained 10% SDS-PAGE showing further purification of EGIcore- 
HFBI fusion protein from the endogenous CBHI when the top phase was re- 
extracted with 2% detergent. Lane 1, Molecular weight marker; Lane 2, 
Purified CBHI (4 ^g), Lane 3, Purified EGI (4 ng); Lane 4, Detergent phase 
5 (top phase) after first extraction; Lane 5, Detergent phase (top phase) after 

second extraction. 

Fig. 11 Coomassie-stained 10% SDS-PAGE analysis of the EGI-HFBI protein when 
treated with thrombin. Lane 1, Molecular weight marker; Lane 2, EGI-HFBI 
10 (1 mg/ml) treated 72 h with 3 U of thrombin at 24°C; Lane 3, Same as Jane 2, 

except no thrombin was added; Lane 4, EGI-HFBI (1 mg/ml) treated 48 h with 
9 U of thrombin at 36°C; Lane 5, Same as lane 4, except no thrombin was 
added; Lane 6, Same as lane 5, except no incubation at 36°C. 

15 Fig. 12 shows the map of the plasmid pTNS13. 

Fig. 13 Western analysis of the partitioning of dCBD-HFBI fusion protein in two-phase 
separation using 5 % of the detergent C12-C18E05. Fusion protein was 
detected with anti-HFBI antibody. Lane I, Four times concentrated culture 
20 filtrate; Lane 2, Four times concentrated bottom phase; Lane 3, Top phase. 

Fig. 14 shows the map of the plasmid pTNS15. A non-functional restriction site is 
indicated with an asterisk. 

25 Fig. 15 shows the map of the plasmid pTNSlS. 

Fig. 16 shows the map of the plasmid pTNS23. 



Fig. 17 
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shows that separation to the detergent phase of cells expressing a hydrophobia 
on cell surface occurs: the top detergent phase of S. cerevisiae VTT-C-993 1 5 
is turbid, whereas the top detergent phase of the control strain H2 155 is clear. 



10 



WO 00/58342 PCT/FI 00/00 249 

15 

Fig. 18 shows the map of the plasmid pTNS32. 

Fig. 19 shows the map of the plasmid pTH4. 

5 Fig. 20 shows the map of the plasmid pTNS30. 

Fig. 2 1 shows the map of the plasmid pTHl . 

Fig. 22 shows the map of the plasmid pTH2. 

Fig. 23 shows the map of the plasmid pKS2. 

Fig. 24 SDS gel showing the purification of HFBI-dCBD in a Berol 532 ATPS system. 



15 Fig. 25 Western analysis of the partitioning of HFBI-ENA5ScFv fusion protein in two- 
phase separation using 4% of the detergent C12-C1 8EOS. Lanes from left: 
(1) Molecular weight marker, (2) Top (enriched) phase of the strain VTT-D- 
00791, (3) Bottom (depleted) phase of the strain VTT-D-00791. 

20 Fig. 26 Western analysis of the partitioning of SC3 hydrophobin using SC3 specific 
antibodies. Lane 1, Bottom phase of the strain VTT-D-00792, Lane 2, Top 
phase of the strain VTT-D-00792. 



25 



Fig. 27 HPLC analysis for monitoring of the partitioning of HFBI in 2% Berol 532. 
Fig. 28 HPLC analysis for monitoring of the partitioning of HFBII in 2% Bero! 532. 
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The invention is further illustrated by the following Examples which describe construction 
of the fusion molecules of the invention, and partitioning of the molecules of interest using 
the process according to the invention. 
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EXAMPLES 
Example 1 

Construction of vectors for expression of EGI and EGIcore HFBI fusion proteins 
5 under the cbhl and gpdl promoters of THchoderma and gpdA promoter of Aspergillus 

For construction of an EGI-HFBI fusion protein, hfbl (SEQ ID 1) coding region (from Ser- 
23 to the STOP codon) and a peptide linker (Val Pro Arg Gly Ser Ser Ser Gly Thr Ala Pro 
Gly Gly) preceding it was amplified with PCR using pTNS9 as a template and as a 5' primer 

1 0 TCG GG CACTACG TG C CAG TAT AGC AAC GAC TAC TAC TCG CAA TGC 
CTTGTTCCG CGTGGC TCT AGT TCT GGA ACC GCA (SEQ ID 2) and as a 3 1 primer 
TCG TAC GGATCC TCA AGC ACC GAC GGC GGT (SEQ ID 3). pTNS9 has been 
decribed in detail in Example 19. The sequence in bold in the 5' primer encodes 16 C-terminal 
residues of EGI. The sequence in italics is a thrombin cleavage site and the underlined 

1 5 CACTACGTG is a DralU site. The underlined GGATCC in the 3' primer is a BamHI site. 
The 280 bp PCR fragment was purified from agarose gel and ligated to pGEM-T T/A vector 
(Promega) resulting in pMQ102. 

For construction of an EGIcore-HFBI fusion protein, the hfbl coding region (as above) was 
2 0 amplified with PCR using pTNS9 as a template and as a 5' primer ACT ACA CGG AG G 
AGC TC G ACG ACT TCG AGC AGC CCG AGC TGC ACG CAG AGC AAC GGC 
AAC GGC (SEQ ID 4) and as a 3' primer SEQ ID 3. The sequence in bold in the 5' primer 
encodes amino acids 410-425 in EGI and the underlined GAGCTC is a SacI site. The 260 bp 
PCR fragment was purified from agarose gel and ligated to pPCRII T/A vector (lnvitrogen) 
2 5 resulting in pMQ 111. 

In the next step Trichoderma expression vectors for production of EGI-HFBI and EGIcore- 
HFBI fusion proteins under the control of cbhl promoter and terminator sequences were 
constructed. The expression vector used as a backbone in the constructs is pPLE3 (Nakari ex 
30 al (1994) WO 94/04673) which contains a pUC18 backbone, and carries the cbhl promoter 
(SEQ ID 5) inserted at the EcoRI site. The cbhl promoter is operably linked to the full length 
egll cDNA (SEQ ID 6) coding sequence and to the cbhl transcriptional terminator (SEQ ID 
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7). The plasmid pMQl02 was digested with DralU and BamHI and the released 280 bp 
fragment containing hfbJ and linker sequences was purified from agarose gel and ligated to 
pPLE3 digested with DralU and BamHI. The plasmid pMQl 1 1 was digested with Sad and 
BamHI and the 260 bp fragment containing the hfbl sequence was ligated to pPLE3 digested 
5 with Sad and BamHI. The resulting plasmids pMQ103 (Figure 1) and pMQl 13 (Figure 2) 
carry the coding sequences for full-length EG! linked to HFBI via a peptide linker and for 
EGIcore linked to HFBI via its own linker region, respectively, under the control of cbhJ 
promoter and terminator sequences. 

1 0 Trichoderma expression vectors for production of EGI-HFBI and EGIcore-HFBI fusion 
proteins under the control of gpdl promoter and terminator sequences of Trichoderma and 
gpdA promoter and trpC terminator sequences of Aspergillus were constructed as follows. A 
SacII site was inserted inbetween the Xbal and Pad sites of pMV4 using as an adapter 
annealed primers TAA CCG CGG T (SEQ H> 8) and CTA GAC CGC GGT TAA T (SEQ 

15 ID 9). The resulting plasmid is pMVQ. pMV4 contains a pNEB193 (New England Biolabs) 
backbone, and carries a 1.2 kb Trichoderma gpdl promoter (SEQ ID 10) and a 1.1 kb gpdl 
terminator (SEQ ID 11) inserted at Sall-Xbal and BamHI-AscI sites, respectively. The 
expression cassettes for EGI-HFBI and EGIcore-HFBI were released from pMQ103 and 
pMQ113 with SacII and BamHI, purified from agarose gel and ligated to pMVQ cut with 

20 SacII and BamHI. The resulting plasmids pMQ104 (Figure 3) and pMQl 14 (Figure 4) carry 
the EGI-HFBI and EGIcore-HFBI cassettes, respectively, under the control of Trichoderma 
gpdl transcriptional control sequences. Expression plasmids pMQ105 (Figure 5) and 
pMQ115 (Figure 6) containing EGI-HFBI and EGIcore-HFBI cassettes, respectively, 
operably linked to the gpdA promoter and trpC terminator of Aspergillus were constructed. 

2 5 EGI-HFBI and EGIcore-HFBI cassettes were released from plasmids pMQ 1 04 and pMQ 1 1 4 
with Xbal and BamHI, blunted with T4 DNA polymerase and ligated to Ncol digesteded and 
T4DNA polymerase treated pAN52-l (SEQ ID 12). pAN52-l contains a pUC 18 backbone, 
and carries a 2.3 kb gpdA promoter and a 0.7 kb trpC terminator sequences of A. nidulans. 
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Example 2 

Construction of vectors for over-production of HFBI on cellulase-inducing and - 
repressing media 

5 For over-expression of HFBI under cbhl promoter the protein coding region of hfbl was 
amplified with PCR using as a template pEAlO (Nakari-Setala et al, 1996). pEAlO carries a 
5.8 kb genomic Sail fragment containing hfbl coding and flanking sequences. GTC AA C 
CGC GG A CTG CGC ATC ATG AAG TTC TTC GCC ATC (SEQ ID 13) was used as 
a 5* primer in the PCR and as a V primer SEQ ID 3. The sequence in bold in the 5' primer is 

10 21 bp of cbhl promoter adjacent to translation^ start site of the corresponding gene and the 
underlined CCGCGG is a Kspl site. The obtained fragment of 430 bp was digested with Kspl 
and BamHI and ligated to pMQ103 digested with Kspl and BamHI. The resulting plasmid 
pMQ121 (Figure 7) carries the coding sequence of hfbl operably linked to cbhl 
transcriptional control sequences. pEAlO plasmid is used for over-production of HFBI in 

1 5 cellulase-repressing conditions. 

Example 3 

Transformation of Trickoderma and purification of the EGI-HFBI and EGIcore-HFBI 
producing and HFBI over-producing clones 

20 

Trichoderma reesei strains QM9414 (VTT-D-74075) and Rut-C30 (VTT-D-86271) were 
co-transformed essentially as described (Penttila et al., 1987) using 3-13 ug of the plasmids 
pMQ103, pMQ113, pMQ104, pMQ114, pMQ105, pMQ115, pMQ121 and pEAlO and as 
the selection plasmids 1-3 ug pToC202, p3SR2 or pAR021. pToC202 (pUC19 backbone) 

2 5 and p3SR2 (pBR322 backbone) plasmids carry 2.7 kb Xbal and 5 kb EcoRI-Sall genomic 
fragments of A mdulans, respectively, containing the amdS gene (Hynes et al, 1983; Tiibum 
et al, 1983), pAR021 is essentially the same as pRLMex30 (Mach et al. 1994) and carries 
the E. coli hph gene operably linked to 730 bp of pkil promoter and 1 kb of cbh2 terminator 
sequences of T. reesei. The Amd+ and Hyg+ transformants obtained were streaked three 

30 times onto plates containing acetamide and hygromycin, respectively (Penttila et al, 1987). 
Thereafter spore suspensions were made from transformants grown on Potato Dextrose agar 
(Difco). 
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The production of the fusion proteins EGI-HFBI and EGIcore-HFBI and HFB1 was tested by 
slot blotting or Western analysis with EGI and HFBI specific antibodies from shake flask or 
microtiter plate cultivations carried out in minimal medium supplemented with either glucose, 
5 lactose or a mixture of Solka flock cellulose and/or spent grain and/or whey. The spore 
suspensions of the fusion protein producing clones were purified to single spore cultures on 
selection plates (containing either acetamide or hygromycin). To determine the best 
producers, production of the -fusion proteins was analyzed again from these purified clones as 
described above. 

10 

T. reesei strains selected for further fermentor cultivations are VTT-D-98692 (pEAlO), VTT- 
D-98492 (pMQ121), VTT-D-98693 (pMQ103), VTT-D-98691 (pMQ113), VTT-D-98681 
(pMQl05) and VTT-D-98682 (pMQ115). These strains have QM9414 as the host strain. 
VTT-D-99702 (pMQl 13) has Rut-C30 as the host strain. 

15 

Example 4 

Cultivation of the EGI-HFBI and EGIcore-HFBI protein producing and HFBI over- 
producing Trichoderma strains 

20 EGI-HFBI and EGIcore-HFBI fusions were produced under the cbhl promoter in a 15-litre 
fermenter using T. reesei strains VTT-D-98693 (pMQl03) and VTT-D-98691 (pMQ113), 
respectively. Strains were grown 5 days on minimal medium (Penttila et al, 1987) containing 
4% Solka flock cellulose (James River Corporation, Berlin, NH) and 2 % spent grain 
(Primalco, Koskenkorva, Finland). EGIcore-HFBI was also produced in fermenter (15 1) 

25 using the Rut-C30 strain VTT-D-99702 (pMQ113) with 4 % lactose medium. To induce the 
production of EGI-HFBI and EGIcore-HFBI fusions under Aspergillus gpdA promoter, T. 
reesei strains VTT-D-98681 (pMQ105) and VTT-D-98682 (pMQl 15) were cultivated in 1 5- 
litre fermenter. Strains were grown 3 to 5 days on minimal medium supplemented with 2% 
glucose, 0.2% Peptone, and 0.1% Yeast Extract, and with glucose feed to maintain the 

30 glucose concentration in the range of 1 to 3% throughout the cultivation. HFBI over- 
producing strain VTT-D-98692 (pEAlO) was grown similarly in 15 1 on glucose medium and 
the strain VTT-D-98492 (pQM121) over-producing HFBI under cbhl promoter was 
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cultivated for 7 days in 15-litre fermentor on medium containing 4% Solka flock and 2% 
spent grain. The control cultivations with the host strains of the transformants, QM9414 
(VTT-D-74075) and Rut-C30 (VTT-D-86271), were carried out on media containing i) 
Solka flock cellulose and either spent grain or whey, ii) lactose and iii) glucose similarly as 
5 described above. 

When proper some T. reesei transformant strains and their host strains were also cultivated at 
28°C in shake flasks for 5 to 6 days in 50 to 1 50 ml volume of Trichoderma minimal medium 
(Penttila ei al. , 1 987) suplemented with either 3 % Solka flock cellulose and 1 % spent grain or 
1 0 3-4% glucose with glucose feeding. 

Example 5 

Standard separation assays and analysis 

15 If not otherwise stated the standard ATPS and subsequent analyses and calculations were 
carried out as explained in this example. 

In general whole fermentation broth, supernatant (biomass separated by centrifugation or 
filtration) or purified proteins in buffer were separated in 10 ml graduated tubes. First 

2 0 detergent was added into the tubes and the tubes were then filled to 1 0 mg with protein 
containing liquid. The amount of detergent in the tube was calculated in weight percent of 
detergents. After thorough mixing in an overhead shaker the separation took place by either 
gravity settling in a water bath at constant temperature or by centrifugation at constant 
temperature. The separation usually was performed at 30°C, the standard amount of 

2 5 detergent used was 2-5% (w/v). After separation the volume ratio was noted and samples 
were taken from the lighter and heavier phase for analysis. 

Two-phase separations were analysed qualitatively by using SDS-PAGE gels followed by 
visualization of the fusion proteins with Coomassie brilliant blue R-250 (Sigma) or Western 
30 blotting. Polyclonal anti-HFBI antibody were used in Western analysis for detection of 
EGlcore-HFBI, EG1-HFBI and dCBD-HFBI proteins together with alkaline phosphatase 
conjugated anti-rabbit IgG (Bio-Rad). Alkaline phosphatase activity was detected 
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colorimetrically with BCIP (5-bromo-4-chloro-3-indolyl-phosphate) used in conjunction with 
NBT (nitro blue tetrazoJium) (Promega). 

Contaminating endogenous EGI, CBH3 and EGEQ in the top phase was tested with 
5 appropriate antibodies. Acidic protease activity in the top and bottom phase was also tested 
using the SAP method (Food Chemicals Codex, p. 496-497, 1981), which is based on the 30 
min enzymatic hydrolysis of a hemoglobin substrate. All reactions were performed at pH 4.7 
and 40 °C. Unhydrolyzed substrate was precipitated with 14% TCA and removed by 
filtration. The released tyrosine and tryptophan was determined spectrophotometrically. Total 
10 protein concentrations were determined by Non-Interfering Protein Assay™ (Geno 
Technology, Inc). 

EGI activity was detected using 4-methylumbelHferyl-B-D-celIobioside (MUC) (Sigma M 
6018) as substrate (Van Tilbeurgh H. & Caeyssens M., 1985; Van Tilbeurgh et.al, 1982). 

15 EGI hydrolyses the ft-glycosidic bond and fluorogenic 4-methylumbelliferone is released, 
which can be measured using a fiuorometer equipped with a 360 excitation filter and a 455 
nm emission filter. CBHI also hydrolyses the substrate and it was inhibited by addition of 
cellobiose (C-7252, Sigma). EGI containing liquid was added in an appropriate dilution to a 
buffer containing 50 mM sodium acetate buffer (pH 5), 0.6 mM MUC and 4.6 mM 

2 0 cellobiose. The mixture was heated to 50°C. The reaction was stopped after ten minutes 
using 2% NaaCOj, pH 10. Purified CBHI was detected using the same assay as for EGI 
without the addition of the inhibitor cellobiose. 

The partition coefficient K was defined as the ratio of the measured concentrations or 
2 5 activities in the top and bottom phase, respectively. 
The Yield Y was defined as follows: 



1+ 




I " 









3 0 where Yt is the Yield of the top phase, V B and V T are the volumes of top and bottom phase, 
respectively. The Yield of the bottom phase can be described accordingly. 
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The mass balances, e.g. recovery of all added protein, were always checked for completeness 
to ensure no artificially high Yield (e.g. due to possible inactivation of the protein in the 
bottom phase). The values were usually calculated based on total enzyme activity (EGI wt 
plus the EGI-fusion) and thus the values are underestimated for the separation of the fusion as 
5 demonstrated in Example 16. 

Example 6 

Small scale A TPS separation studies and gel analysis 

10 EGI-HFBI and EGIcore-HFBI fusions produced under the cbhl promoter in a 15-litre 
fermenter on Solka flock cellulose and spent grain medium as described in Example 4 using T. 
reesei strains VTT-D-98693 (pMQ103) and VTT-D-98691 (pMQ113), respectively, were 
separated in small scale ATPS as described above. 

1 5 The phases from the two phase separations were analysed qualitatively by using SDS-PAGE 
gels followed by visualization of the fusion proteins with Coomassie brilliant blue or Western 
blotting. Coomassie stained SDS-PAGE (10%) is shown in Figure 8. In the lane containing 
the non-extracted culture filtrate three distinct closely migrating bands can be seen (the 
sample was diluted 1/10 with H 2 0). The topmost band is CBHI, the band in the middle is 

2 0 EGIcore-HFBI fusion and the lower one endogenous EGI. In the samples separated in 
ATPS, only two bands (CBHI and EGI) are seen in the sample from bottom phase and one 
band representing EGIcore-HFBI in the sample obtained from the top phase. 

Western blotting with HFBI antibody showed thick bands for the top phase, whereas for the 

2 5 bottom phase there was only faint band demonstrating that the fusion is separating strongly 

into the detergent top phase. Figure 9 shows the separation of the EGIcore-HFBI fusion 
produced on cellulose media into the top phase. Contaminating endogenous EGI and EGIIl 
in the top phase was tested with appropriate antibodies but no signal was detected. 

3 0 Small amount of endogenous CBHI was found in the upper phase when CBHI antibody was 

used in Western blotting. EGI, EGO and proteases were not found in the top phase. Further 
purification from the contaminating CBHI was observed when the top phase was re-extracted 
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with 2 % detergent. The Figure 1 0 shows that the upper phase does not any more contain 
CBHI and pure fusion protein is recovered. 

EGIcore-HFBI was also produced in fermenter (15 1) using the Rut-C30 strain VTT-D- 
5 99702 (pMQH3) with 4 % lactose medium. The separation in ATPS carried out in the 
standard manner gave essentially the same result as the separation from cellulose containing 
medium thus demonstrating that the purification can be carried out from several media 
relevant for large scale industrial use. 

1 0 Acidic protease activity in the top was only 1/15 compared to the bottom phase (table below) 
demonstrating that acidic proteases remain in the bottom phase. 





A (275 run) 


HurVmi 


Bottom phase 1 


0.146 


41.6 


Top phase 2 


0.009 


2.6 



l l/10 diluted bottom phase after separation of VTT-D-98691 culture filtrate with 2% detergent 
1 5 2 1/100 diluted bottom phase after separation of VTT-D-98691 culture filtrate with 2% detergent 

3 1 HUT = enzyme concentration, which in reaction conditions hydrolyses hemoglobin in 1 min so that 
the absorbance at 275 nm of the formed hydrolysate equals 1 . 10 ug tyrosine/mi 0.006 N HC1 solution. 

These results show that the fusion protein can be purified extremely efficiently and the 
2 0 resulting preparate is free of other proteins produced by the fungus including proteases. 

Example 7 

Recovery of the native EGI in ATPS after thrombin cleavage 

25 EGI-HFBI protein produced by the strain VTT-D-98693 has a thrombin cleavage site 
(LVPRGS) designed in the linker region between the EGI CBD and HFBI, which would 
enable the recovery of the native EGI after thrombin cleavage. EGI-HFBI fusion protein was 
purified from the culture filtrate (100 ml) of strain VTT-D-98693 grown on 4% Solka flock 
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cellulose and 2 % spent grain as described in Example 4 using the 2-phase separation system 
(5 % detergent). After removal of the bottom phase the detergent phase was extracted by 
isobutanol. The resulting water phase (-19 ml) was divided in eppendorf tubes and the liquid 
was evaporated with speed vac. Remaining tyophilizate was diluted to 50 mM Tris-Cl (pH 8). 
5 To test the efficiency of thrombin cleavage, 9 units of thrombin (Sigma) was incubated > 24 h 
with 1 mg EGI-HFBI fusion protein in 36 °C at pH 8.0. Coomassie stained SDS-PAGE (10 
%) was used for detection. 

Only minor cleavage was observed in 48 h under these conditions (Figure 11), possibly due to 
1 0 steric hindrance by O-glycosylation in the linker. 

Example 8 

Separation of low concentrations of EGIcore-HFBI in ATPS 

15 Detergent based aqueous two-phase systems were successfully applied using very low 
concentrations (diluted) of EGIcore-HFBI fusion protein produced with the cbhl promoter in 
T. reesei VTT-D-98691 (pMQ 113) from a 15 liter cultivation carried out on Solka flock 
cellulose with spent grain as described in Example 4. 

2 0 The original protein concentration of the supernatant was 7.0 mg/ml. This supernatant was 
diluted with de-ionised water by a factor of 100 and 1000, respectively. The fusion protein 
could be separated using 2% (w/w) of the detergent C12-C18E05 with partitioning 
coefficients higher than 5. This is shown in the table below together with the experiment with 
non-diluted supernatant. The partitioning coefficients were calculated based on activity 

2 5 measurements for total EGI (wild type and fusion protein together). 





EGIcore-HFBI 
non-diluted supernatant 


dilution 1/100 


dilution 1/1000 


K 


4.1 


5.3 


5.6 


Y[%] 


38 


31 


32 
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Example 9 

Separation of EGIcore-HFBI from fungal biomass containing culture broths 

EGIcore-HFBI from T.reesei strain VTT-D-98691 (pMQ 113-2) was cultivated (50 ml in 
5 250 ml shake flasks) on Solka flock cellulose with spent grain as described in Example 4. 
Directly after the cultivation, part of the whole broth was centrifuged at 3000 rpm for 30 min, 
supernatant was spilled out and the centrifuged mycelium was added to the supernatant to 
obtain artificial whole broths containing different amounts of biomass. 

10 Using 5% of C12-C18E05 in a 10 g experiment consisting of up to 50% wet biomass 
(weight of wet biomass divided by the sum of wet biomass and supernatant) could still be 
separated without any difficulties. The Yield remained in between 61 and 64 % and therefore 
it is not significantly different in comparison to the experiment carried out with supernatant 
only (without mycelium) (see table below). The total recovery of the fusion protein is even 

15 higher. This is most probably due to cell attached enzyme extracted in the ATPS increasing 
the total amount of EGI. The partitioning coefficient was calculated based on activity 
measurements for total EGI (wild type and fusion protein together), 





K 


Y[%] 


Supernatant 


5.5 


62 


25% of wet biomass in 
supernatant 


7.3 


66 


40% of wet biomass in 
supernatant 


6.4 


61 


50% of wet biomass in 
supernatant 


7.6 


64 
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Example 10 

Separation of EGI-HFBI in ATPS 

EGI-HFBI from Trichoderma reesei strain VTT-D-98693 (pMQ 103) from a 15 liter 
5 cultivation carried out on Solka flock cellulose and spent grain as described in Example 4 was 
separated in a 10 g experiment using different amounts of C12-08E05. The partitioning 
coefficients are shown below. The partitioning coefficient was calculated based on activity 
measurements for total EG1 (wild type and fusion protein together), and as in previous 
examples the endogenous EGI is included in the partitioning coefficients. 



0 



Detergent [% w/w] 


2 


3 


5 


7 


K 


1.9 


1.8 


1.4 


1.1 



Example 11 

Separation of EGIcore-HFBI in 50 ml 

5 

EGIcore-HFBI from T. reesei strain VTT-D-98691 (pMQ 113) cultivated in 15 liters using 
Solka flock cellulose and spent grain as described in Example 4 was separated in Falcon tubes 
in a 50 g experiment using 5% of C12-C18E05. A partition coefficient of 2.52 and a yield of 
51 % could be obtained. The separation was performed at 30°C at 3000rpm for 30 minutes. 
0 The values are based on activity measurements for total EGI activity (wild type and fusion 
protein together) including endogenous EGI. 

Example 12 

Separation of EGIcore-HFBI in ATPS using different detergents 

5 

EGIcore-HFBI from T. reesei strain VTT-D-98691 (pMQ 113) cultivated in 15 liters using 
Solka flock cellulose with spent grain as described in Example 4 was separated in a 10 g 
experiment using 2% of detergent in each experiment. The detergents investigated in this 
example were C10 E05, C12 E05, C14 E06 (each Nikko Chemicals, Japan), C12-C18 E05 
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(„Agrimul NRE 1205", Henkel, Germany), C12/14 5EO, C12/14 6EO ( Clariant, Germany), 
C9/1I E05.5 („Berrol 266", Akzo Nobel, Germany), Triton X-l 14 (Sigma, Germany). The 
partition coefficients and yields are listed below. The values are based on activity 
measurements for total EGI activity (wild type and fusion protein together) including 
5 endogenous EGL 





K 


Y(fusion) [%] 


C10EO5 


20 


56 


C12E05 


15 


57 


C12-C18E05 


14 


66 


C12/14 5EO 


12 


58 


C12/14 6EO 


14 


62 


C14E06 


11 


54 


C9/1 1 E05.5 


5 


30 


Triton X-l 14 


0.16 


53 



1 0 Example 13 

Separation of EGIcore-HFBI in ATPS from glucose grown cultures 

EGIcore-HFBl was separated from cultivation of the Trichoderma reesei strain VTT-D- 
98682 (pMQ115) cultivated with glucose as described in Example 4. The supernatant was 
15 separated with 2% of the detergent C12-C18 E05. The fusion protein could be partitioned 
with a K value of 2.4. In comparison, the K value for the native EGI is 0.3 when measured in 
a similar way for purified EGI. 

Example 14 

2 0 Separation of EGlcore-HFBI using different concentrations of detergent 



EGIcore-HFBI from T. reesei VTT-D-98691 (pMQ 113) cultivated in 15 litres using Solka 
flock with spent grain as described in Example 4 was separated in detergent based ATPS 
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applying different amounts of the detergent C12-C18 E05 on the cell free supernatant. The 
partitioning coefficients are shown in the table below. The corresponding gel electrophoresis 
and Western antibody-blots are shown in Figure 8 and Figure 9, respectively. 
The values are based on activity measurements of total EGI activity. 



Amount of detergent C12-C18 E05 


K 


Yield(%) 


1.0% 


6.1 


9 


2.0% 


4.1 


38 


3.5% 


3.6 


50 


5.0% 


2.9 


55 


7.5% 


1.7 


53 


10.0% 


1.1 


58 



Example 15 

Re-extraction of the detergent phase 

10 

Detergent based ATPS was applied on EGIcore-HFBl fusion protein containing supernatant 
produced by the strain VTT-D-98691 (pMQ 113) in a shake flask cultivation. The first 
extraction using C12-C18E05 conducted under the standard conditions shows a partitioning 
coefficient of 16 and a yield of 72 % ( wild type EGI measured together with fusion protein). 

15 The top phase was re-extracted in 10 mM sodium acetate buffer (pH 5) with 2% of 
detergent. A partitioning coefficient of 52 and a yield of 89 % could be obtained. In the re- 
extraction experiment of the bottom phase (2% of detergent), a small yield of 7.5% and a K 
of 0.8 of EGI activity were achieved. The partitioning coefficients were calculated based on 
activity measurements for total EGI (wild type and fusion protein together). Due to the wild 

2 0 type EGI present in the sample, the yield is at least 72% and the partitioning coefficient at 
least 16 in the first extraction. The SDS-PAGE results of both extraxtions are shown in 
Figure 10. 
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Separation step 


K 


Y[%] 


2% detergent 


16 


72 


reextr action top phase 


52 


89 


reextraction bottom phase 


0.8 


7.5 



Example 16 

Separation of pure cellulases in ATPS 

5 

The effect of HFB1 on partitioning and the final yield can further be demonstrated by 
comparing the extraction result of EGIcore-HFB fusion with extraction results obtained with 
purified wild type EGI and EGIcore. The fusion protein is partitioning more than 100 times 
better to the detergent phase (see table belove). 

10 

The improvement on the partitioning of the purified fusion protein from the first extraction 
obtained in the re-extraction (see Example 15) can be explaned by the partitioning of the wild 
type EGI as demonstrated with purified wild type EGI in the table below. The wild type EGI 
lowers the partitioning coefficient in the first extraction (since EGI activity is measured from 
1 5 both top and botom phase), but the absence of it in the re-extraction increases the partitioning 
coefficient of the EGIcore-HFBI fusion. The purity can in addition be demonstrated by 
analysing the partitioning of pure CBHI, which is the major contaminating protein 
corresponding to about 50 % of all secreted T. reesei proteins. Pure CBHI has a partitioning 
coefficient of 0.5 and a yield of 3 .6 and is therefore separated from the fusion protein. 



Separation step 


K 


Y[%] 


re-extraction of top phase 


52 


89 


extraction of pure wild type EGI 


0.3 


2.2 


extraction of pure EGI-core 


0.3 


2.3 


extraction of pure CBHI 


0.5 


3.6 
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Using the definitions of K and Y and calculating mass balances, the ratio of the amount of 
EGI fusion protein to EGI wild type can be calculated. The "true" partition coefficients and 
Yields can be concluded from this. "True" means the values which would be detected if it 
would be possible to measure only the amount of EGI-fusion without measuring the amount 
5 for EGI wild type at the same time. 

The fundament for the calculation is the re-extraction experiment. The re-extracted top phase 
is believed to be pure. An example of the measured values and the calculated "true" values 
based on this are shown in the table below for two cultivations of VTT-D-98691 (pMQl 13) 
1 0 grown as described in Example 4. 



cultivation 
vessel 


cultivation 
substrate 


K "with EGI 
wt" 


"true" 
K 


Y [%] "with 
EGI 

wt" 


"true" . 
Y[%] 


15 liter 
fermenter 


whey permeate 


4 


6 


16 


54 


250 ml shake 
flask 


cellulose 


16 


54 


66 


90 



1 S Example 17 

HFBI and HFBII purification in ATPS 

HFBI was produced by cultivating the T. reesei strain VTT-D-98692 (pEA10-103B) using 
glucose as substrate as described in Example 4. HFBI could be separated using 2% of the 
2 0 detergent CI2-C18 E05 with a partition coefficient higher than 20 under the standard 
conditions described. 

HFBII was produced by cultivating the T. reesei strain VTT-D-74075 (QM9414) on whey 
spent grain as described in Example 4. HFBII could be separated using 2% of the detergent 
2 5 C12-C1 8 E05, exceeding a partition coefficient of 10 under the standard conditions. 



Both HFBI and HFBII hydrophobins are thus partitioning well to the upper phase in ATPS. 
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Example 18 

Detergent based ATPS with additional NaCl 

EGIcore-HFBI from which cultivation of T.reesei was separated in a 10 g experiment using 
5 5% of C12-C18E05. The partitioning coefficient of the supernatant was 3.5 with a volume 
ratio of 0.2. Using 1.1 % (w/V) NaCl the partitioning coefficient could be increased to 4.3 
with a lower volume ratio of 0. 14. 

Example 19 

10 Construction of an E coli strain expressing a fusion protein HFBI-dCBD, containing 
hydrophobin I and double cellulose binding (CBD) domains 

A 280 bp DNA fragment containing a modified cbh2 linker region followed by the coding 
region of hfbl from Ser-23 to the STOP codon was amplified by PCR using the plasmid 

15 pAROl (Nakari-Setala etal. 7 1996) as a template. The 5' primer was 5' TCT AGC AAG 
CTT GGC TCT AGT TCT GGA ACC GCA CCA GGC GGC AGC AAC GGC AAC 
GGC AAT GTT TGC (SEQ ID 14) and the 3' primer was 5' TCG TAC AAG CTT TCA 
AGC ACC GAC GGC GGT (SEQ ID 15). The sequences in bold in the 5' and 3' primers 
encode the modified CBrffl linker (Gly Ser Ser Ser Gly Thr Ala Pro Gly Gly) and a 

2 0 translation^ STOP, respectively, and the underlined AAGCTT in both primers is a Hindlll 
site. The PCR fragment was purified from agarose gel, digested with Hindlll and ligated to 
Hindlll digested and SAP treated (Shrimp Alkaline Phosphatase, USB) pSP73 resulting in 
plasmid pTNS9. 

2 5 For subsequent cloning of the modified CBHII linker-HFBI fragment to an E. coli expression 
vector, pTNS9 was digested with HindHI and the proper fragment was purified from agarose 
gel. This Hindm fragment was cloned to Hindlll digested and SAP treated (Shrimp Alkaline 
Phosphatase, USB) B599 resulting in pTNS13 (Figure 12). The E. coli expression vector 
B599 is essentially the same as the one described by Linder et al. (1996) except that it is 

30 missing a STOP codon at the end of the protein coding sequence. It carries the coding 
sequence for a fusion protein containing CBHII CBD (41 N-terminal residues of CBHII) and 
CBHI CBD linked together via CBHI linker region (CBHI linker and CBD are the last 57 
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residues in CBHI). The expression and secretion of the fusion protein in B599 is under the 
cotrol of tac promoter and pelB signal sequence (Takkinen et al , ( 1 99 1 ). pTNS 1 3 expression 
vector thus carries the coding region for a fusion protein of double CBD and HFBI linked in 
frame via the Gly-Ser-Ser-Ser-Gly-Thr-Ala-Pro-Gly-Gly peptide. This vector also contains 
5 the amp gene for selection of E. coli transformants. pTNS 13 plasmid was transformed into E. 
coli strain RV308 (su-, AIacX74, ga/ISIl::OP308, strA) and this strain was used for 
production of the fusion protein. 

Example 20 

1 0 Separation of HFBI-dCDB molecules expressed in Rcoli in ATPS 

dCBD-HFBI was produced in E. coli strain RV 308 transformed with pTNS13 plasmid as 
described above. The inoculum of RV308/pTNS13 was grown to the exponential growth 
phase in LB medium containing ampicillin (0. 1 g/1) and 1% glucose. Fermentation was carried 

15 out using mineral salt medium described by Pack et al. (1993) with glucose (feed) in 10 litre 
fermenter. During cultivation temperature was maintained at 28 °C and pH was controlled at 
6.8 with NH4OH. Cell growth was monitored by measuring ODgoo and dry weight of 
biomass. The culture was induced with 50 uM (final concentration) IPTG (isopropyl-p-D- 
thiogalactopyranoside) at late-exponential growth phase (00600=50-60) to promote fusion 

2 0 protein production. 

Two-phase separation analysis of dCBD-HFBI protein was performed using culture filtrate 
and 5% detergent in the total volume of 40 ml. Results from Western blotting showed that 2- 
phase separation with 5 % detergent in the standard way was highly specific also for the 
25 dCBD-HFBI fusion. Strong signal was observed in the sample from the detergent phase 
compared to the sample from the bottom phase as shown in Figure 13. 
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Example 21 

Construction of yeast strains expressing HFBI-Flol fusion protein on the celi surface 



For construction of a HFBI-FLOl fusion protein expression cassette, hfbj (SEQ ID 1) 
5 coding region (from Ser-23 to the STOP codon) was amplified with PCR using pAROl 
(Nakari-Setala et ai, 1996) as a template and as a 5' primer TCT AGC TCT AG A AGC 
AAC GGC AAC GGC AAT GTT (SEQ ID 16) and as a 3' primer TGC TAG TCG ACC 
TGC TAG C AG CAC CGA CGG CGG TCT G (SEQ ID 17). The underlined sequences in 
the 5' and 3' primers are Xbal and Nhel sites, respectively. The 0.225 bp PCR fragment was 

10 purified from agarose gel and ligated to pGEM-T vector (Promega) resulting in pTNSlO. 
The hjbl fragment was released from pTNSlO with Xbal and Nhel and ligated to pTNS15 
cut with the same restriction enzymes. Plasmid pTNS 1 5 (Figure 14) is essentially the same as 
pJasmid pBR-ADH 1 -FLO 1 L by Watari et al 1994 except that a Nhel site in the pBR322 
backbone has been replaced by a BglJI site and a unique Xbal site is introduced by linker 

1 5 cloning in the unique AocI site preceding the putative signal sequence cleavage site. The 
resulting plasmid pTNSIS (Figure 15) contains the complete expression cassette for HFBI- 
FLOl fusion protein in which HFBI substitutes the putative lectin domain from Ser-26 to 
Ser-3 1 9 in the yeast fiocculin FLO 1 (SEQ ID 1 8). 



2 0 In the next step, yeast expression vector for production of HFBI-FLOl fusion protein was 
constructed. The expression vector used as a backbone in the construct is pYES2 
(Invitrogen) (SEQ ID 19) which is a high-copy episomal vector designed for inducible 
expression of recombinant proteins in S. cerevisiae. It carries GAL J promoter and CYCl 
terminator sequences which regulate transcription, and 2u, origin of replication and URA3 

2 5 gene for maintenance and selection in the host strain. The plasmid pTNSl 8 was digested with 

HindlH and the released 3.95 kb fragment containing the expression cassette for HFBI-FLO 1 
was purified from agarose gel and ligated to pYES2 digested with Hindm This ligation 
mixture was concentrated by standard ethanol precipitation. The ligation mixture should 
contain besides unligated fragments and uncorrect ligation products also molecules where the 

3 0 vector and insert are correctly ligated with each other to result in plasmid pTOS23 (Fig. 16) 

which carries the expression cassette for HFBI-FLOl operably linked to GAL J and CYC J 
terminator sequences. 
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The above ligation mixture was transformed using the Li Ac method of Gietz et a!. (1992) 
into a laboratory S. cerevisiae strain H452 (wild type W303-1A; Thomas and Rothsteiru 
1989). Transforrnant colonies able to grow on SC-URA plates were picked and streaked on 
5 selective plates. Nitrocellulose replicas were taken from the plates and treated for colony 
hybridization according to Sherman et al (1983). To find those yeast colonies containing the 
pTNS23 plasmid, replicas were hybridized with digoxigenin labelled hfbl coding fragment 
after which an immunological detection was performed all according to the manufacturer 
(Boehringer Mannheim). Plasmids were recovered from several yeast colonies giving positivie 
10 hybridization signal by isolating total DNA and using this in electroporation of E. coli. 
Restriction mapping and sequencing were carried out to confirm that the pTNS23 plasmid in 
the yeast transformants was correct. One of the transformants carrying plasmid pTNS23 was 
chosen for further studies and was designated VTT-C-993 1 5. The control strain for it is yeast 
strain H2155 which carries the plasmid pYES2 in H452 background. 

15 

Example 22 

Separation of yeast cells expressing HFBI-Flol fusion protein in AITS 

The Saccharomyces cerevisiae strain VTT-C-993 15 (vector pTNS23) and its control strain 
20 H2155 (vector pYES2) were cultivated on synthetic complete medium lacking uracil (SC- 
URA) (Sherman, 1991) with 2% galactose as the carbon source to give an A$oo of 
approximately 4. Approximately 6.3 x 10 8 cells in their culture medium were taken to ATPS 
using 7% (w/v) C12-18E05 detergent (Agrimul NRE from Henkel) in a total volume of 5 ml. 
ATPS was carried out using strandard protocol. After phase separation by gravity settling, 
2 5 the top detergent phase was clearly turbid in the case of the strain VTT-C-993 1 5 in contrast 
to the control strain whose detergent phase was clear (Figure 17). Samples were taken from 
the top phases and dilution series from 10" 1 to 1 0' 5 were prepared in 0.9% NaCl and plated on 
YPD plates. After incubation at 30°C the amount of yeast colonies were calculated showing 
at least 70 times more yeast colonies of the strain VTT-C-993 18 on YPD plates compared to 
30 the control strain. This clearly manifests that also in a system over-loaded with cells, 
separation to the detergent phase of cells expressing a hydrophobin on cell surface occurs. 
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Example 23 

Partitioning of EGIcore-HFBI fusion protein in hydrophobin- containing and non- 
containing pure systems 

5 The effect of free HFBI and HFBII hydrophobic on the partitioning of EGIcore-HFBI 
was investigated by comparing how efficiently a sample of purified EGIcore-HFBI in 50 
mM acetate buffer pH 5.0 was extracted to the detergent phase in the presence and 
absence of purified HFBI and HFBII. The extraction was followed by measuring the 
depletion of hydrolytic activity on a soluble substrate such as methylumbelliferyl 
10 cellobioside in the aqueous phase before and after extraction. When 0.7 g/1 of either 
purified hydrophobin was present in the separation of 0.02 g/1 of EGIcore-HFBI with 2% 
C12-18E05 at 30°C the extraction was affected in the following way: 93% of the 
protein was extracted when additional hydrophobin was not present, but with HFBII 
present 82% was extracted and with HFBI present 88% was extracted. 

15 

Example 24 

Production of EGIcore-HFBI fusion proteins in T. reesei Ahfb2 strain for improved 
partitioning of the fusion protein in ATPS 

2 0 Trichoderma reesei strain QM9414 Ahfb2 (VTT-D-99726) was transformed essentially as 
described (Penttila et al t 1987) using 10 ug of the plasmid pMQl 13 (described in Example 
1 ) together with 3 ug of the selection plasmid pTOC202 containing the amdS gene (Hynes et 
al, 1983); Tilburn et al, 1983) of Aspergillus nidulans encoding for acetamidase. pMQl 13 
contains an expression cassette for production of EGIcore-HFBI fusion protein under the 

2 5 control of cbhl promoter and terminator sequences. 

The Amd+ transformants obtained were streaked two times onto plates containing acetamide 
(Penttila et al, 1 987). Thereafter spore suspensions were made from transformants grown on 
Potato Dextrose agar (Difco). The production of the EGIcore-HFBI fusion protein was 

3 0 tested by slot blotting or Western analysis with EGI and HFBI specific antibodies from shake 

flask or microliter plate cultivations carried out in minimal medium supplemented with Solka 
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flock cellulose. The spore suspensions of the clones producing fusion protein were purified to 
single spore cultures on selection plates containing acetamide. To determine the best 
producers, production of the fusion protein was analyzed again from these purified clones as 
described above. 

5 

For partitioning experiments of the EGIcore-HFBI fusion protein in ATPS using the 
polyoxyethylene detergent C12-18E05 the best production strain obtained in this study and 
as control strains VTT-D-98691 (QM9414 strain producing EGIcore-HFBI) and VTT-D- 
74075 (QM9414) were cultivated in shake flasks on medium containing Solka flock cellulose 
10 as described in Example 4. 

Standard partitioning experiments as described in Example 5 are carried out with culture 
supernatants. After separation the volume ratio of the lighter and heavier phase is noted and 
the concentration factor for the fusion protein is calculated from it. Samples are also taken 
15 from the lighter and heavier phase and analysed with SDS-PAGE, Western blotting and 
activity measurements as described in Example 5. Partition coefficients (K) and yields (Y) are 
calculated as described in Example 5. 

Example 25 

2 0 Construction of vectors for expression of EGIcore-HFBII and EGIcore-SC3 fusion 

proteins under the cbhj promoter of T. reesei 

For construction of an expression vector for production of EGIcore-HFBII fusion protein, 
hfl>2 (SEQ ID 20 ) coding region (from Ala- 16 to the STOP codon) was amplified by PCR 
25 using phfb2 (Nakari-Setala et al 1997) as a template and as a 5' primer CGG AGGAGC 
TCG ACG ACT TCG AGC AGC CCG AGC TGC ACG CAG GCT GTC TGC CCT 
ACC GG (SEQ ID 21) and as a 3' primer TCA TTGGATCCT TAG AAG GTG CCG 
ATG GC (SEQ ID 22). The sequence is bold in the 5' primer encodes for amino acids 413- 
425 of EGI and the underlined GAGCTC is a Sad site. The underlined GGATCC in the 3' 

3 0 primer is a BamHI site. The amplified fragment was digested with Sad and BamHI and 

ligated to pMQl 13 cut with the same restriction enzymes. The resulting plasmid is pTNS32 
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(Figure 18) and it carries the coding sequence for EGIcore-HFBII fusion protein under the 
control of cbhl regulatory sequences (SEQ ID 5 and SEQ ID 7). 

For construction of an expression vector for production of EGIcore-SC3 fusion protein, SC3 
5 cDNA (SEQ ID 23) was ampified with PCR using cSC3/pUC20 plasmid as a template and as 
a 5* primer ACT ACA CGG AGGAGCTCG ACG ACT TCG AGC AGC CCG AGC 
TGC ACG CAG GGT GGC CAC CCG GGC (SEQ ID 24) and as a 3' primer TCG TAC 
GGATCC TCA GAG GAT GTT GAT GGG (SEQ ID 25). The sequence is bold in the 5' 
primer encodes for amino acids 413-425 of EGI and the underlined GAGCTC is a Sad site. 
1 0 The underlined GGATCC in the 3 1 primer is a BamHI site. The amplified fragment was 
digested with SacI and BamHI and ligated to pMQ103 (described in Example 1) cut with the 
same restriction enzymes. The resulting plasmid is pTH4 (Figure 19) and it carries the coding 
sequence for EGIcore-SC3 fusion protein under the control of cbhl regulatory sequences 
(SEQ ID 5 and SEQ ID 7). 

15 

The cSC3/pUC20 plasmid contains the 41 1 bp SC3 cDNA from the translational start site to 
the STOP codon in a pUC20 vector. The translational start site has been constructed to a 
Ncol site and a BamHI site has been added after the translational STOP codon. 

2 0 Example 26 

Construction of a vector for expression of a HFBI-dCBD fusion protein containing 
hydrophobic I and double cellulose binding domains (CBD) under the cbhl 
promoter of T. reesei 

25 For construction of an expression cassette for production of HFBI -dCBD fusion protein 
under cbhl promoter, the protein coding region of hfbl was amplified with PCR using 
pEAlO (Nakari-Setala et al. Eur. J. Biochem. (1996) 235:248-255) as a template. GGA 
ATT CCG CGG ACT GCG CAT CAT GAA GTT CTT CGC CAT CGC C (SEQ ID 
26) was used as a 5' primer in the PCR and TGA ATT C CA TAT G TT AGG TAC CAC 

30 CGG GGC CCA TGC CGG TAG AAG TAG AAG CCC CGG GAG CAC CGA CGG 
CGG TCT GGC AC (SEQ ID 27) as a 3' primer. The sequence in bold in the 5' primer 
is 16 bp of cbhl promoter adjacent to translational start site of the corresponding gene 
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and the underlined CCGCGG is a Kspl site. The underlined and bold sequences in the 3' 
primer are Ndel and Asp718 sites, respectively. The sequence in Italics in the 3' primer 
encodes for a Methionine-containing linker (PGASTSTGMGPGG). The resulting 
fragment of 370 bp was digested with Kspl and Ndel and ligated to pAMHUO 
5 (Nevalainen, K.M.H., Penttila, M.E., Harkki, A., Teeri, T.T. and Knowles, J. (1991) In 
Molecular Industrial Mycology. Eds. Leong, S.A. and Berka, R.. Marcel Dekker. New 
York) digested with same restriction enzymes. The resulting plasmid is pTNS29-2Asp. 

For further cloning steps, an Asp718 site was removed from the pTNS29-2Asp 
1 0 polylinker. The vector was digested with SacI and BamHI and the cleaved vector ends 
were blunted with T4 DNA polymerase and ligated together. The resulting vector 
pTNS29 is missing the SacI, Asp718 and Smal sites present in the pTNS29-2Asp 
polylinker. 

15 In the next step, a double cellose binding domain (dCBD) was amplified with PCR using 
pTNSll as a template. pTNSll carries the coding sequence for a fusion protein 
containing T. reesei CBHII CBD (41 N-terminal residues of CBHII) and CBHI CBD 
linked together via CBHI linker region (CBHI linker and CBD are the last 57 residues in 
CBHI). This DNA sequence originates from an E. coli expression vector B599 which is 

2 0 essentially the same as the one described by Linder et al. (J. Biol. Chem. (1996) 
271:21268-21272). TGA ATT CGGTACCCA GGC TTG CTC AAG CGT C (SEQ ID 
28) was used as a 5' primer in the PCR and TGA ATT CCATATGTC ACA GGC ACT 
GAG AGT AGT A (SEQ ID 29) as a 3' primer. The underlined sequences in the 5' and 3' 
primers are Asp718 and Ndel sites, respectively. The amplified fragment was digested 

25 with Asp718 and Ndel and ligated to pTNS29 digested with Asp718 and Ndel resulting 
in pTNS30. 

pTNS30 (Figure 20) expression vector thus carries the coding region for a fusion protein 
consisting of HFBI and double CBD linked in frame via the Methionine linker peptide 
30 (PGASTSTGMGPGG). Expression of the fusion protein is regulated by the cbhl 
transcriptional control sequences. The expression cassette may be released from the 
plasmid with EcoRI and Sphl. 
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Example 27 

Construction of vectors for expression of HFBI-single chain antibody fusion proteins 
under the cbhl promoter of T. reesei 

5 An expression vector was constructed for production of a fusion protein consisting of T. 
reesei HFBI protein in the N-terminus and in the C-terminus a single chain antibody 
recognizing a small molecular weight derivative of diarylalkyltriazole (ENA5ScFv). 
Production of the fusion protein is under the cbhl regulatory sequences. For construction of 
HFBI-ENA5ScFv fusion protein, pENA5ScFv was digested with Ncol and Xbal. The 

1 0 fragment containing the ena5scfv gene and the histidine tail (6 x His) was biut-end cloned to 
pTNS29 resulting in pTHl (Figure 21). pENA5ScFv vector carries the coding region for 
ENA5 single chain antibody consisting of the variable domains of the heavy and light chains 
connected via a glycine serine linker and a 6 x histidine tag at the C-terrninal end. 
Transcription and secretion of the single chain antibody are under control of the tac promoter 

15 and pelB signal sequence, respectively (Takkinen et at, 1991). pTNS29 vector carries the 
hfbl coding region of T. reesei followed by a linker sequence 
(ProGlyAlaSerThrSerThrGlyMetGlyProGlyGly) under the control of cbhl promoter and 
terminator sequences. 

20 For construction of HFBI-ENA5ScFv fusion protein with a thrombin cleavage site in the 
linker peptide, enaSsefv coding region (from Ala-23 to the STOP codon) and a peptide linker 
containing the thrombin cleavage site (Gly Thr Leu Val Pro Arg Gly Pro Ala Glu Val Asn 
Leu Val) preceeding it was amplified with PCR using pENA5ScFv as a template and as a 5' 
primer GAA TTC GGT ACC CTC GTC CCT CGC GGT CCC GCC GAA GTG AAC 

2 5 CTG GTG (SEQ ID 30) and as a 3' primer TGA ATT C CA TAT G CT AAC CCC GTT 
TCA TCT CCA G (SEQ ID 3 1). The sequence in bold in the 5' primer encodes the first 6 
N-terminal residues of ENA5SCFV. The sequence in italics is a thrombin cleavage site and 
underlined GGT ACC is an Asp71 8 site. The sequence in bold in the 3' primer encodes the 6 
C-terminal residues of ENA5ScFv and the underlined CA TATG is a Ndel site. The 790 bp 

30 PCR fragment was purified from agarose gel and ligated to pTNS29 resulting in pTH2 
(Figure 22). 
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Example 28 

Construction of a vector for production of class I hydrophobin SC3 of Schizophyllum 
commune in Trichoderma reesei for ATPS 

5 AT. reesei strain producing class I hydrophobin SC3 of S. commune was constructed. For 
that purpose an expression vector carrying the SC3 cDNA under the control of hfbl 
promoter and hfbl terminator of T. reesei was constructed. 

The hfbl terminator (SEQ ID 32) was amplified with PCR using as a template pEAlO 
1 0 (Nakari-Setala et al. 1996) and as a 5' primer GAC CTC GAT GCC CGC CCG GGG TCA 
AG (SEQ ID 33) and as a 3' primer GTC GAC ATT TCA TTT TAC CCC CCT CG 
(SEQ ED 34). The underlined sequences in the 5' and 3' primers are Sacl and Sail sites, 
respectively. The PCR fragment was cut with Sacl and Sail so that the Sacl site was 
blunted withKlenow. The fragment was cloned into cSC3/pUC20 (described in Example 
1 5 25) vector digested with Sail and BamHI where the BamHI was blunted with Klenow. In 
the next step, the hfb2 promoter (SEQ ID 35) was amplified with PCR using as a template 
pTNS8 (Nakari-Setala et al. 1997) and using as a 5' primer AAG CTT GCA TGC CTG 
CAT CC (SEQ ID 36) and as a 3' primer CCA TGG TGA AAG GTG GTG ATG GTT 
GG (SEQ ID 37). The underlined sequences in the 5' and 3' primers are Hindlll and 
20 Ncol sites, respectively. The PCR fragment was digested with Hindlll and Ncol and 
cloned in front of the SC3 cDNA in the plasmid obtained in the previous step and cut 
with the same restriction enzymes. The resulting plasmid is pKS2 (Figure 23). 

Example 29 

25 Construction of T. reesei strains producing EGIcore-HFBII, EGIcore-SC3, HFBI- 
dCBD and HFBI-single chain antibody fusion proteins, and SC3 hydrophobin 

Trichoderma reesei strains VTT-D-74075 (QM9414), VTT-D-86271 (Rut-C30) and VTT- 
D-99676 (Rut-C30 Ahjb2) were co-transformed essentially as described (Penttila et al., 
30 1987) using 3-13 ug of the plasmids pTNS32, pTHl, pTH2, pTH4, pTNS30 and pKS2 and 
as the selection plasmids 1-3 ug pToC202 or pAR021. pToC202 carrying the artidS gene 
(Hynese/a/., 1983; Tilbum^/a/., 1983) of A. nidulansand pARQ21 carrying the E. cohhph 
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gene have been described in Example 3, The Amd+ and Hyg+ transformants obtained were 
streaked three times onto plates containing acetamide and hygromycin, respectively (Penttila 
et ai, 1987). Thereafter spore suspensions were made from transformants grown on Potato 
Dextrose agar (Difco). 

5 

The production of the fusion proteins EGIcore-HFBH, EGIcore-SC3, HFBI-dCBD and 
HFBI-ENA5ScFv and the SC3 hydrophobin was tested by slot blotting or Western analysis 
with EGI, SC3, CBD and HFBI specific antibodies from shake flask or microtiter plate 
cultivations carried out in minimal medium supplemented with either lactose or Solka flock 
1 0 cellulose. The spore suspensions of the fusion protein producing clones were purified to 
single spore cultures on selection plates (containing either acetamide or hygromycin). To 
determine the best producers, production of the fusion proteins was analyzed again from 
these purified clones as described above. 

15 T reesei strains selected for further cultivations are X46A (pTNS32, host QM9414), VTT- 
D-00793 (pTH4, host Rut-C30 bhft>2\ VTT-D-99727 (pTNS30, host Rut-C30 Ahfb2\ 
VTT-D-00791 (pTHl, host Rut-C30) and VTT-D-G0792 (pKS2, host Rut-C30 Ahfb2). 
These strains were cultivated as described in Example 4 and the culture supematants were 
subjected to ATPS. 

20 

Example 30 

Recovery of the protein product from the enriched detergent phase after ATPS by 
isobutanol or other solvents 

25 After phase separation has occurred and the hydrophobin or the hydrophobin fusion 
protein is enriched in the detergent phase (the enriched phase), the protein can be 
recovered to an aqueous buffer by addition of isobutanol or other solvent. For example a 
series, of identical extractions of 0.05 g/1 HFBI in 50 raM acetate buffer was made with 
2% Berol 532. To separate tubes 10% final concentration of either isobutanol, n-amyl 

30 alcohol, octanol, or octane was added, and the fraction of hydrophobin recovered to the 
aqueous phase was analyzed. It was found that 100% was recovered with isobutanol, 
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89% with n-amyl alcohol, 81 % with octanol and 70 % with octane. Fractions were 
analyzed by HPLC as described in Example 38. 

5 Example 31 

Separation of EGIcore-HFBII fusion protein in ATPS. 

T.reesei strain X46A was grown in shake flasks on 3% lactose medium as described in 
Example 4. C12-18E05 detergent was added to a final concentration of 5% to the 
culture supernatant. After mixing the system was allowed to settle and the enriched 

10 detergent phase was subjected to further extraction with an equal volume of isobutanol. 
After buffer exchange on Biogel P-6 (Bio-Rad, USA) gel filtration media, the extracted 
protein was analysed by ion exchange chromatography using a Mono Q (Amersham 
Pharmacia, Sweden) column equilibriated with 1 5 mM acetate and eluted using a linear 
gradient of NaCl. Separation of the EGIcore-HFBII fusion protein to the detergent phase 

15 was verified by analyzing the eluted peak fraction from the Mono Q chromatography run 
by its activity on 4-methylumbelliferyl cellobioside as well as its re-extraction by ATPS. 

Example 32 

Separation of EGIcore-SC3 fusion protein in ATPS 

20 

Culture filtrate of T.reesei strain VTT-D-00793 producing EGlcore-SC3 fusion protein 
was subjected to APTS by adding 5% final concentration of C12-18E05. After the 
settling, the phases were separated and the fusion protein enriched detergent phase was 
further treated with an equal volume of isobutanol which removes the detergent to the 
25 isobutanol and leaves the protein in the aqueous phase. The resulting fusion protein 
enriched aqueous phase was then desalted on Biogel P-6 (Biorad, USA). Separation of 
fusion protein to the detergent phase was verified by analyzing the enzymatic activity of 
the EGIcore fusion partner on 4-methylumbelliferyl cellobioside in that phase. 



30 
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Example 33 

Separation of HFBI-dCBD fusion protein in ATPS 

T.reesei strain VTT-D-99727 producing HFBI-dCBD fusion protein was grown on 
5 lactose-containing medium as described in Example 4. Berol 532 was added to a final 
concentration of 2 % to 500 ml of culture supernatant. The mixture was allowed to settle 
and the phases separated in a separation funnel. The enriched top phase (10 ml) was 
extracted with an equal volume of isobutanol and 50 mM acetate buffer pH5. The fusion 
protein was selectively enriched to the buffer in the separation as shown by SDS gel 
10 electrophoresis (Figure 24). 

Example 34 

Recovery of the HFBI-dCBD fusion partners after chemical cleavage 

15 HFBI-dCBD protein produced by the strain VTT-D-99727 has a methionine 
(PGASTSTGMGPGG) designed in the linker region between the HFBI and the dCBD, 
which would enable the recovery of the native HFBI and dCBD after chemical cleavage with 
CNBr. 

2 0 The HFBI-dCBD fusion protein was purified as described in Example 33 . The resulting water 

phase (-108 ml) still contained a small amount of CBHI and free hydrophobin in addition to 
the HFBI-dCBD fusion (Figure 24). Further purification of the sample was performed by 
chromatography. The sample was desalted on a Biogel P-6 column equilibrated with 50mM 
sodium acetate buffer pH 5.5, diluted 1 + 3 with water and applied on a CM-Sepharose FF 
25 column equilibrated with lOmM sodium acetate buffer pH 5.5. The CBHI protein was found 
in the flow through and the purified HFBI-dCBD protein was eluted with 0.2M NaCl. 

A sample of purified HFBI-dCBD protein was evaporated in speed-vac almost to dryness. 
CNBr cleavage was performed by adding a solution of CNBr in 0.1M HC1 (5mg/ml) to the 

3 0 evaporated sample in 5 times weight excess. The reaction was allowed to continue for 24h at 

room temperature (in dark). A 10 times volume of water was added and the sample was 
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evaporated in speed-vac almost to dryness. The evaporated sample was diluted in 50mM 
Tris-HCl pH 7 and the cleavage was confirmed by HPLC analysis as described in Example 
37. 

5 The treatment was further optimised by using different concentrations of HC1 and incubation 
times. The final CNBr treated sample was subjected to ATPS as described in Example 33. 
HFBI was found in the top-phase and the dCBD in the bottom phase, thus leading to a 
separation of both fusion partners. 

1 0 Example 35 

Separation of HFBI-single chain antibody fusion protein in ATPS 

Strain VTT-D-00791 was cultivated for 7 days on medium containing 10 g/1 potassium 
phtalate, 15 g/1 KH 2 P0 4 , 5 g/L (NH^SC^ and 3% lactose and 0.2% peptone. 30 ml of 
15 culture medium was subjected to ATPS using 4% C12-18-E05 detergent as described in 
Example 5. After removal of the bottom phase, the detergent phase was extracted by 
isobutanol. Proteins from the water phase after detergent extraction and the bottom phase of 
the isobutanol extracted detergent phase were precipitated with trichloroacetic acid (final 
concentration 10%), resuspended in SDS-PAGE sample buffer (taking into acount the 

2 0 concentration factor of the isobutanol extracted detergent phase) and analysed by Western 

blotting using HFBI specific antibodies. The fusion protein runs as a dimer (approximately 70 
kDa ) in SDS-PAGE under the conditions used as seen in Figure 25. The water phase sample 
still contained a small amount of fusion protein. Nevertheless, separation of the fusion protein 
can be obtained by ATPS. 

25 

Example 36 

Separation of SC3 hydrophobin in ATPS 

SC3 hydrophobin producing strain VTT-D-00792 was cultivated in shake flasks on lactose- 

3 0 containing medium as described in Example 4. Soluble secreted proteins produced by the 

fungus were precipitated with trichloroacetic acid. Precipitated proteins were solubilized in 
trifluoroacetic acid to dissociate the SC3 aggregates whereafter the acid was evaporated with 
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a stream of air. The treated proteins were solubilized in 2 ml water containing 2.5 % of the 
detergent CI 2-1 8E05. Phase separation was carried out as described in Example 5. Samples 
were taken from the lighter and heavier phase and analysed by Western blotting using SC3 
specific antibodies. On the basis of this analysis, SC3 hydrophobin partitions to the detergent 
5 phase (Figure 26). 

Example 37 

Separation of nisin in ATPS 

10 1 mg of pure nisin (Sigma) (equals to 1000 IU) in 50 mM sodium acetate buffer pH 5 was 
subjected to ATPS using 2% of the detergent C12-18E05 at 30°C, and the phases were 
allowed to settle. After removal of the bottom phase, the detergent phase was extracted by 
isobutanol which removes the detergent to the isobutanol and leaves nisin in the aqueous 
phase. Separation of nisin to the enriched aqueous phase was assayd testing nisin 

15 bioactivity as described by Qiao et al. 1996. Comparison of the halos on the assay plate 
produced by the enriched phase and the nisin controls indicated that nisin had separated 
to the detergent phase with a concentration factor of approximately 5. 

2 0 Example 38 

Preparative purification of HFBI and HFBII in ATPS 

For preparative purposes, HFBI and HFBI were extracted from 0.5 1 of culture 
supernantants from cultivations carried out on media containing glucose or cellulose as 

25 described in Example 4. 2% (w/w) detergent was added to the culture supernatant which 
was mixed and then allowed to settle in a separation funnel at 20°C if C11E02 (Berol 
532) was used and 30°C if C12-18E05 (Henkel) was used. The detergent phase 
(enriched phase) was collected and mixed with an equal volume of isobutanol. When 
C11E02 was used an equal volume of 50 mM acetate buffer was also added. The 

30 remaining culture supernatant is the depleted phase. To follow the purification, analytical 
HPLC was run on samples from each step (Figure 27and 28). On the basis of HPLC 
analysis, both HFBI and HFBII partitioned well to both of the detergents. When 
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C11E02 was used, K values of >1000 and >78 were obtained for HFBl and HFBII, 
respectively. 

Water phase after isobutanol extraction was collected and fbrher purified on a Vydac 
5 1x20 cm seraipreparative C4 column (Vydac, USA) eqilibriated with 0.1% trifluoro 
acetic acid in water and eluting with a linear gradient of 0.1% trifluoro acetic acid in 
acetonitrile. The proteins were then lyophilized. In analytical runs a 4.6 mmx20 cm 
Vydac C4 column was used. 

1 0 Example 39 

Screening for separation conditions using purified HFBI and HFBII 

HFBI and HFBII which were purified from culture supernatants in ATPS, further 
purified by preparative HPLC and lyophilized as described in Example 38 were dissolved 

15 in water to a concentration of 0.5 mg/ml. The stock solutions were then diluted with the 
buffer appropriate for the experiment. 50 mM acetate buffer pH5, 50 mM glycine pH 3, 
50 mM HEPES pH 7 was used for pH screening and NaCl or (NH^SCk in acetate 
buffer was used for ionic strength experiments. The surfactant was weighed with the 
appropriate amount of hydrophobin solution and incubated in 0, 20, 30, 40°C and the 

20 amount of hydrophobin in the depleted phase determined. Partitioning coefficients were 
calculated from the change in concentration and the volume ratios and are presented in 
the tables below, 

25 
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Table presenting K-values for the separation of hydrophobins with different surfactants 
and temperature. 





Temperature 


Henkel 


Berol 532 


Berol 266 


Triton x 114 


HFBI 


0°C 


0,1 


368 


NP 


NP 




20°C 


0,4 


1961 


NP 


NP 




30°C 


85 


2970 


NP 


0,9 




40°C 


148 


682 


28 


69 


HFBII 


0°C 


NP 


76 


NP 


NP 




20°C 


1,0 


139 


NP 


NP 




30°C 


67 


194 


NP 


4,0 




40°C 


44 


102 


19 


44 



5 NP = no phase separation 



Table presenting the effect of different salt concentrations at 40°C on K-values 







Henkel 


Berol 532 


Berol 266 


Triton xl 14 


HFBI 


50 mM buffer 


244 


578 


13 


39 




+ 1 MNaCl 


239 


33 


198 


241 




+1 M (NH4) 2 S0 4 


0,9 


3,2 


176 


612 














HFBII 


50 mM buffer 


36 


80 


16 


58 




+ 1 M NaCl 


14 


87 


61 


38 




+1 M(NH4) 2 S0 4 


3,1 


351 


95 


35 



Table presenting the K-values for the partitioning of hydrophobins in Berol 532 at 
1 5 different conditions. 





HFBI 0°C 


HFBI 20°C 


HFBII 0°C 


HFBII 20°C 


50 mM acetate pH5 


298 


614 


76 


127 


+1 MNaCl 


614 


3781 


131 


199 


+1M (NH4)aS0 4 


931 


44 


203 


139 


pH3 


614 


1881 


73 


160 


pH7 


361 


614 


54 


126 



WO 00/58342 



48 



PCT/FIOO/00249 



Example 40 

Purification of a new putative T. reesei hydrophobin on the basis of its separation in 
ATPS, and cloning of the corresponding gene 

5 T. reesei strain VTT-D-99726 (QM9414 Ahjb2) was cultivated on lactose-based 
medium in 15 fermentor as described in Example 4. After cultivation, 1 1 of the culture 
filtrate was subjected to ATPS with 5% polyoxyethylene detergent CI 2-1 8E05. After 
phase separation at 30°C, detergent phase was separated from the depleted aqueous 
phase and analysed by SDS-PAGE. The analysed sample contained a protein which 

10 was about 7.5 kDa. In addition, some larger proteins were present. These were 
removed by hydrophobic interaction chromatography on a Phenyl-Sepharose FF 
column equilibriated with 50 mM sodium acetate - 1 M (NH^SC^ buffer. The 
proteins were eluted with descending (NH^SC^ salt gradient. Fractions containing 
the ca. 7.5 kDa protein were pooled and concentrated, and analysed by mass 

1 5 spectrometry. This revealed that the purified protein was degraded to three peptides 
(2486, 2586 and 2709 Da), which were held together by disulphide linkages. The 
terminal amino acid sequence from the 2486 Da peptide was 
ANAFCPEGLLYTNPLCCDLL, which on the basis of position of cysteines and on 
sequence comparison to known hydrophobins is typical to a hydrophobin. 

20 

Degenerated primers were designed on the basis of this amino acid sequence as well 
as on the basis of the sequence obtained from the 2586 Da peptide. These primers 
were used in RT-PCR (RobusT RT-PCR Kit, Finnzymes) with RNA isolated from the 
same cultivation from where the protein was purified. The 140 bp fragment obtained 
25 from the RT-PCR reaction was sequenced. The obtained sequence contained a part 
that was coding for the 2486 Da peptide used to design the PCR primer confirming 
that the PCR product corresponds to the purified protein. 
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Example 41 

Purification of EGIcore-HFBI in ATPS in pilot scale 

The strain VTT-D-99702 producing EGIcore-HFBI fosion under the cbkl promoter was 
5 cultivated in pilot scale fermentor in 1200 L on minimal medium containing 4% lactose, 
0.4 % peptone and 0.1% Yeast Extract for 4 days. Cultivation temperatures were 28°C 
or alternatively the temparature was changed during the cultivation from 27°C to 22°C in 
a step-wise manner. Production level of the fusion protein was a few grams per liter. 
After the end of the cultivation, the mycelium was separated by means of a rotating 
1 0 vacuum drum filter using Celite 53 5 diatomaceous earth as filter aid. 

In separation 1, 1100 L of the supernatant was transferred into the meanwhile cleaned 
bioreactor, adjusted to the separation temperature of 24,7 °C and mixed with salt and 
detergent to give 0,15 M ammonium dihydrogen phosphate and 4,1 % of detergent CI 2- 

15 18E05. The phases were left to separate by gravity settling and the heavier phase 
removed through the bottom valve. A parallel experiment in 10 ml volume was carried 
out to investigate the effect of scale up on the separation of the fusion protein. The 
partition coefficient and the concentration factor were equal for the 10 ml and 1200 L 
separation within the measurement error. The separated detergent phase was re- 

2 0 extracted using the detergent phase from the first extraction by replacing half of the 
volume of the bottom phase by tap water. The salt concentration was adjusted to 0,25 M 
ammonium dihydrogen phosphate at 30 D C. K and Y values of the separations are 
presented in the below Table. 

25 Alternatively in separation 2, 10 ml of drum filtrated culture medium samples were 
subjected to ATPS at 25°C using 4.1% C12-18E05 detergent and 1.15 M (NH^HsPCV 
K and Y values after one separation step are presented in the Table below. 

30 
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Table presenting extraction experiments of culture supernatants from different pilot scale 
cultivations 



Separation 


Cultivation 
temperature 


Concentration factor 


K 


Y [%] 


Separation I 


27°C 22°C 


3.4 


3.8 


59 


Re-extraction 






8 


70 


Separation 2 


29°C 


2.8 


8.3 


81 



Example 42 

Separation of EGI, EGIcore-HFBI fusion protein and HFBI hydrophobin in robust 
micelle/polymer systems 

Separations were performed with purified samples. HFBI was purified by two-phase 
separation and HPLC as decribed in Example 38. EGlcore-HFBl was purified by two-phase 
extraction as decribed in Example 5 followed by desalting on a Biogel P-6 column 
equiliberated with 20mM sodium acetate puffer pH 6 containing 150mM NaCl. The purified 
proteins were subjected to ATPS using different micelle/poiymer systems. 

The table below presents partitioning coefficients of pure EGI, HFBI and EGIcore-HFBI 
at constant tie-line length, in the different systems, Triton X-114/water system (4.1 % 
wt) at 31.5° C, Triton X-100/Reppal/water system (8.1 % wt. / 8.2 % wt.) and Triton 
X-114/Reppal/water system (5.0 % wt. / 4.0 % wt.) at 21° C. The standard deviation is 
shown within the brackets. The systems were buffered with 25 mM sodium acetate, pH 
4.0. K > 1 is equivalent to a preferred protein partitioning to the micelle rich phase. 





EGI 


HFBI 


EGIcore-HFBI 




Kegi 


Khfb 


K.EGIHFB 


Yield (%) 
top phase 


Triton X-l 14 


0.6 (0.1) 


21.5 (1.5) 


8.4 (1.0) 


85 


Triton X-100/Reppal 


0.9 (0.1) 




1.7 (0.1) 


56 


Triton X-l 14/Reppal 


0.7 (0.1) 


11.0 (1.1) 


15.4 (2.9) 


91 



15 
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Claims 

1 . A process for partitioning of proteins or cells in aqueous two-phase systems (ATPS), 
comprising the steps of 

a) in order to obtain a fusion protein or cell, combining a protein or a cell of interest 
to a targeting protein selected from the group consisting of amphipathic and hydrophobic 
proteins having the ability to partition in ATPS and to carry said protein or cell of 
interest into one of the phases, and 

b) subjecting said fusion protein or cell carrying the targeting protein to an ATPS 
separation. 

2. The process according to claim 1, wherein the targeting protein is a hydrophobin or 
hydrophobin-like protein or a part thereof. 

3. The process according to claim 2, wherein the hydrophobin is a Trichoderma 
hydrophobin or a part thereof. 

4. The process according to claim 3, wherein the Trichoderma hydrophobin is HFBI, 
HFBII or SRHI ,or a part thereof. 

5. The process according to claim 1, wherein the amphipathic or hydrophobic proteins 
or parts thereof form aggregates. 

6. The process according to any one of claims 1 to 5 for partitioning cells in ATPS, 
wherein in step a) of claim 1 the combination of the cells of interest to the targeting 
protein comprise bringing said targeting protein onto the surface of said cells. 

7. The process according to claim 6, wherein the cells are yeast cells. 

8. The process according to claim 6 wherein the cells are spores. 

9. The process according to any one of claims 6 to 8, wherein the targeting protein is 
fused to a protein which brings the targeting protein onto the surface of the cell. 
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1 0. A fusion protein, comprising a hydrophobin or hydrophobin-like protein as defined in 
any one of claims 2 to 5 fused to a protein of interest. 

1 1. The fusion protein according to claim 10, wherein the protein of interest is a cell- 
bound protein or a part thereof 

12. The fusion protein according to claim 10, wherein the protein of interest is an 
extracellular protein or a part thereof. 

13. The fusion protein according to claim 12, wherein the extracellular protein is an 
extracellular protein of Trichoderma, selected from the group consisting of cellulases, 
hemicellulases and proteases. 

14. The fusion protein according to claim 10, wherein the protein of interest is an 
antibody protein or a part thereof. 

15. The process according to claim 1, wherein the targeting protein is fused to the 
protein of interest according to any one of the claims 10 to 14. 

16. A recombinant organism producing a fusion protein according to any one of claims 
10 tol4. 

17. The recombinant organism according to claim 16, wherein the organism has been 
genetically modified to be capable of producing a fusion protein according to any one of 
claims 10 to 14. 

18. A recombinant DNA molecule, comprising a DNA molecule encoding a fusion 
protein according to any one of claims 10 to 14. 

19. A process for producing a fusion protein according to any one of the claims 10 to 14 
with recombinant organisms, comprising the steps of 

a) transforming the organism with DNA molecules enabling expression of such proteins, 
and 
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b) recovering such protein from the culture of the recombinant organism. 



20. The process according to any one of claims 1 to 9 and 15, wherein the aqueous two- 
phase system is selected from the group consisting of PEG/salt, PEG/Dextran and 

5 PEG/starch systems or derivatives thereof, detergent-based aqueous two-phase systems 
and thermoseparating polymer systems. 

21 . The process according to claim 20, wherein the detergent-based ATPS comprises a 
detergent which is selected from the group consisting of nonionic and zwitterionic 

10 detergents. 

22. The process according to claim 20, wherein the thermoseparating polymer system 
comprises a polymer which is selected from the group consisting of polyethylene- 
polypropylene copolymers. 

15 

23. The process according to any one of claims 1-9 and 15, wherein the protein or cell of 
interest is separated from a suspension containing cells or cell extracts. 

24. A process for separating hydrophobins or hydrophobin-Hke proteins or parts thereof 
20 in aqueous two-phase systems, comprising the steps of 

a) mixing solutions containing said hydrophobic hydrophobin-like protein or parts 
thereof with the phase forming chemicals, and 

b) carrying out ATPS separation, 

wherein the aqueous two-phase system is as defined in any one of claims 20 to 22. 

25 
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SEQUENCE LISTING 

<110> Valtion teknillinen tutkimuskeskus 

<120> Process for partitioning of molecules 

<130> 31805 

<140> 
<141> 

<160> 42 

<170> Patentln Ver. 2.2 

<210> 1 

<211> 428 

<212> DNA 

<213> Trlchodezma. reesei 
<220> 

<221> intron 

<222> (167) . . (235) 

<220> 

<221> intron 
<222> (323) . . (386) 

<220> 

<223> Coding sequence of hfbl 
<400> l 

atgaagtcct tcgccatcgc cgctctcttt gccgccgctg ccgttgccca gcctctcgag 60 

gaccgcagca acggcaacgg caatgtttgc cctcccggcc tcttcagcaa cccccagtgc 120 

tgtgccaccc aagtccttgg cctcatcggc cttgactgca aagtccgtaa gttgagccat 18 0 

aacataagaa tcctcttgac ggaaatatgc cttctcactc cCttaccccC gaacagcctc 240 

ccagaacgtt tacgacggca ccgacttccg caacgtctgc gccaaaaccg gcgcccagcc 300 

tctctgctgc gtggcccccg ttgtaagttg atgccccagc tcaagctcca gtctttggca 360 

aacccattct gacacccaga ctgcaggccg gccaggctct tctgtgccag accgccgtcg 420 

gtgcttga 428 



<210> 2 
<211> 78 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5 1 primer 



<400> 2 

tcgggcacta cgtgccagta tagcaacgac tactactcgc aatgccttgt tccgcgtggc 60 
tctagttctg gaaccgca 78 
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<210> 3 
<211> 30 
<212> DNA 

<2l3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 3 

tcgtacggat cctcaagcac cgacggcggt 3 0 

<210> 4 

<211> 63 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 

<400> 4 

actacacgga ggagctcgac gacttcgagc agcccgagct gcacgcagag caacggcaac 60 
ggc 63 

<210> 5 

<211> 2211 

<212> DNA 

<213> Trichoderma. reesei 
<220> 

<221> promoter 

<222> (1) . . (2211} 

<223> cbhl promoter sequence 

<400> 5 

gaattctcac ggtgaatgta 
acctccatta cgcctccccc 
tagattgggg agaagttgac 
ggtcggcaac ggcaaaaaag 
tccaggaacc tggatacatc 
tattcgccct aaaccgaagt 
gtgtgtcttc tctaggtgca 
tccgagctgt aactacctct 
tgcatcatgt atataatagt 
tggtcatcaa acaaagaacg 
agaactggat acttgttgtg 
tctactcaaa caccaagctt 
agttgtgaag tcggtaatcc 
ctgctgcgaa cccggagaat 
catgaaaggc tatgagaaat 
acaagcaaag cgttccgtcg 
taagtagcga tggaaccgga 
caatgcaggg gtactgagct 
ggcgtttccc tgattcagcg 
ggacgtgttt tgcccttcat 
tgaccgactg gggctgttcg 
aggcatgttg tgaatctgtg 



ggccttctgt agggtaggaa 
atagagttcc caatcagtga 
ttccgcccag agctgaaggt 
cacgtggctc accgaaaagc 
catcatcacg cacgaccact 
gcgtggtaaa tctacacgtg 
ttctttcctt cctctagtgt 
gaacctctgg agaatggtgg 
gatcctgaga aggggggttt 
aagacgcctc ttttgcaaag 
tcttctgtgt atttttgtgg 
gctcttttga gctacaagaa 
cgctgtatag taatacgagt 
cgagatgtgc tggaaagctt 
tctggagacg gcttgttgaa 
cagtagcagg cactcattcc 
ataatataat aggcaataca 
tggacataac tgttccgtac 
tacccgtaca agtcgtaatc 
ttggagaaat aatgtcattg 
aagcccgaat gtaggattgt 
Cc 999cagga cacgcctcga 



ttgtcactca agcaccccca 60 
gtcatggcac tgttctcaaa 120 
cgcacaaccg catgatatag 180 
aagatgtttg cgatctaaca 240 
ttgatctgct ggtaaactcg 300 
ggcccctttc ggtatactgc 360 
tgaattgttt gtgttgggag 420 
actaacgact accgtgcacc 480 
ggagcaatgt gggactttga 540 
ttttgtttcg gctacggtga 600 
caacaagagg ccagagacaa 660 
cctgtggggt atatatctag 720 
cgcatctaaa tactccgaag 780 
ctagcgagcg gctaaattag 840 
tcatggcgtt ccattcttcg 900 
cgaaaaaact cggagattcc 960 
ttgagttgcc tcgacggttg 1020 
cccacctctt ctcaaccttt 1080 
actattaacc cagactgacc 1140 
cgatgtgtaa tttgcctgct 1200 
tatccgaact ctgctcgtag 1260 
aggttcacgg caagggaaac 1320 
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caccgatagc agtgtctagt agcaacctgt aaagccgcaa tgcagcatca ctggaaaata 1380 
caaaccaacg gctaaaagta cataagctaa tgcctaaaga agtcatatac cagcggctaa 1440 
taattgtaca atcaagtggc taaacgtacc gtaatttgcc aacgcgttgt ggggttgcag 1500 
aagcaacggc aaagcccact tcccacgttt gtttcttcac tcagtccaat ctcagctggt 1560 
gatcccccaa ttgggtcgct tgtttgttcc ggtgaagtga aagaagacag aggtaagaat 1620 
gtctgactcg gagcgttttg catacaacca agggcagtga tggaagacag tgaaacgttg 1680 
acattcaagg agtatttagc cagggatgct tgagtgtatc gtgtaaggag gtttgtctgc 1740 
cgatacgacg aatactgtat agtcacttct gatgaagtgg tccatactga aatgtaagtzc 1800 
ggcactgaac aggcaaaaga ttgagttgaa actgcctaag atctcgggcc ctcgggcttc I860 
ggctttgggt gtacatgttt gtgctccggg caaatgcaaa gtgtggtagg atcgaca^ac 1920 
tgctgccttt accaagcagc tgagggtatg cgataggcaa atgttcaggg gccactgcat 19B0 
ggtttcgaat agaaagagaa gctCagccaa gaacaacagc cgataaagat agcctcatta 2040 
aacgaaatga gctagcaggc aaagccagcg aatgtgtata tataaaggcc cgaggtccgc 2100 
gcctccctca tgctctcccc atctactcat caactcagat cctccaggag acttgtacac 2160 
catcttttga ggcacagaaa cccaatagtc aaccgcggac tgcgcatcat g 2211 



<210> 6 

<211> 1588 

<212> DNA 

<213> Trlchoderma reesei 



<220> 

<223> T. reesei egll cDNA 



<400> 6 

cccccctatc 

accacggcca 

cccgaggtcc 

caggacacct 

tcgtgcaccg 

aagaactgct 

agcctcacca 

cggctgtatc 

ctgagcttcg 

tctcagatgg 

agcggctact 

agccaccagg 

gccttgaccc 

ccctatggca 

accttcacca 

agcatcaccc 

gacaccatcn 

gccctgagca 

aaccggctcg 

atcctggcca 

gggtctacta 

actacacgga 

cagtgcggtg 

catagcaacg 

gacgggggca 

aagacatgct 

agcaaaaaaa 



ttagtcctcc 
tcctggccat 
atcccaagtt 
cggtggtcct 
tcaacggcgg 
tcatcgaggg 
tgaaccagta 
tcctggactc 
acgtcgacct 
acgagaacgg 
gcgatgctca 
gcttctgctg 
ctcactctcg 
gcggctacaa 
tcatcaccca 
gcaagtacca 
cgccctgccc 
gcggcatggt 
acagcggcaa 
acaaccccaa 
cgaactcgac 
ggagcccgac 
gcattgggta 
actactactc 
cgatagaatg 
atgtcgtatc 
aaaaaaaaaa 



ttgttgtccc 
tgcccggctc 
gacaacctac 
tgactggaac 
cgtcaacacc 
cgtcgactac 
catgcccagc 
tgacggtgag 
ctctgctctg 

ggg c gccaac 
gtgccccgtc 
caacgagatg 
cacggccacg 
aagctactac 
gtccaacacg 
gcaaaacggc 
gtccgcctca 
gctcgtgttc 
cgccggcccc 
cacgcacgtc 
tgcgcccccg 
gacttcgagc 
cagcgggtgc 
gcaatgcctt 
cgggcacgca 
tacattagca 
aaaaaaaa 



aaaatggcgc 
gtcgccgccc 
aagtgtacaa 
taccgctgga 
acgctctgcc 
gccgcctcgg 
agctctggcg 
tacgtgatgc 
ccgtgtggag 
cagtataaca 
cagacatgga 
gatatcctgg 
gcctgcgact 
ggccccggag 
gacaacggct 
gtcgacatcc 
gcctacggcg 
agcatttgga 
tgcagcagca 
gtcttctcca 
cccccgcctg 
agcccgagct 
aagacgtgca 
tagagcgttg 

gggagctcgt 

aatgacaaac 



cctcagttac 
agcaaccggg 
agtccggggg 
tgcacgacgc 
ctgacgaggc 
gcgtcacgac 
gctacagcag 
tgaagctcaa 
agaacggctc 
cggccggtgc 
ggaacggcac 
agggcaactc 
ctgccggttg 
ataccgttga 
cgccctcggg 
ccagcgccca 
gcctcgccac 
acgacaacag 
ccgagggcaa 
acatccgccg 
cgtccagcac 
gcacgcagac 
cgtcgggcac 
acttgcctcc 
agacattggg 
aaatgaaaaa 



actgccgttg 
taccagc?. cc 
gtgcgtggcc 
aaactacaac 
gacctgtggc 
ctcgggcagc 
cgtctctcct 
cggccaggag 
gctctacctg 
caactacggg 
cctcaacact 
gagggcgaat 
cggcttcaac 
cacctccaag 
caaccttgtg 
gcccggcggc 
catgggcaag 
ccagtacacg 
cccacccaac 
gggagac .-.c 
gacgctttcg 
tcaccggggg 
tacgtgccag 
ggtccgtcca 
cttaatatat 
gaacctatca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1588 
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<210> 7 

<211> 745 

<212> DNA 

<213? Trichoderma reesei 



<22Q> 

<221> terminator 
<222> (1) . . (745) 
<223> T. reesei cbhl terminator 



<400> 7 

ggacctaccc agtctcacta cggccagtgc ggcggtattg gctacagcgg ccccacggtc 60 

tgcgccagcg gcacaacttg ccaggtcctg aacccttact actctcagtg cctgtaaagc 120 

tccgtgcgaa agcctgacgc accggtagat tcttggtgag cccgtatcat gacggcggcg 180 

ggagctacat ggccccgggt gatttatttt ttttgtatct acttctgacc cttttcaaat 240 

atacggtcaa ctcatctttc actggagatg cggcctgctt ggtattgcga tgttgtcagc 300 

ttggcaaatt gtggctttcg aaaacacaaa acgattcctt agtagccatg cattttaaga 360 

taacggaata gaagaaagag gaaattaaaa aaaaaaaaaa aacaaacatc ccgttcataa 420 

cccgtagaat cgccgctctt cgtgtatccc agtaccacgt caaaggtatt catgatcgtt 480 

caatgttgat attgttccgc cagtatggct ccacccccat ctccgcgaat ctcctcttct 540 

cgaacgcggt agtggctgct gccaattggt aatgaccata gggagacaaa cagcataata 600 

gcaacagtgg aaattagtgg cgcaataatt gagaacacag tgagaccata gctggcggcc 660 

tggaaagcac tgttggagac caacttgtcc gttgcgaggc caacttgcat tgctgtcaag 72 0 

acgatgacaa cgtagccgag gaccc 745 



<210> 8 
<2ll> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

•=223> Description of Artificial Sequence: annealed primer 
<400> 8 

taaccgcggt 10 



<210> 9 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: annealed primer 
<400> 9 

ctagaccgcg gttaat 16 



<210> 10 

<211> 1232 

<212> DNA 

< 2 1 3 > Tri choderma. rees ex 
<220> 

<221> promoter 

<222> (1) . . (1232) 
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<223> T. reesex gpdl promotor 



<400> 10 

gtcgacacga tatacaggcg 
atgtgacaat attgactgcg 
caccaaggtg catgcatcat 
gttagtgaag gcaccgtgat 
cagtacgtca agtaacgtca 
gattgcctca atcacatagt 
agattgagag ctaccttaca 
ccccagccgg cgcgattgtg 
tgcttggcaa tcgtgctcag 
gcagttgcta aggtacctag 
taggatggcg gctggcgggg 
aagttgatga taccgttttg 
gatgccgcat caccgttgtg 
ccactatctc aggtaaccag 
gtgttaatct ttccacctcc 
cttcctcctc ttcttctcca 
agcctcgcaa atcctcagtt 
gcccgacgat gcggcttctg 
gccccgccag gactctcctg 
tccctctcgt ttcttcctaa 
cacgcagctc aattcgcaga 



cggctgatga taatgatgat 
aggaaccatc aggtgtgtat 
aaggattctc ctcagctcac 
ggcaagatag aaccactatt 
aagccgccct cccgtaacct 
acctacctat gcattatggg 
tcaatatggc cagcacctct 
tgtactaggt aggctcgtac 
ctgttaggtt gtacttgtat 
ggagggattc aacgagccct 
ccgaagctgg gaactcgcca 
ccagattaga tgcgagaagc 
tcagaaacga ccaagctaag 
gtactaccag ctaccctacc 
ctcctcaatc ttcttttccc 
taaccattcc taacaacatc 
tgtatgtacg t acgt act ac 
ttcgcctgcc cctcctctca 
cgtcaccaat ttttttccct 
caaacaacca ccaccaaaat 
tacaaatcta ga 



cgagcatgac ttgatgctgc 60 
ggatggaatc attctgtaac 120 
caacaacgaa cgatggccat 180 
gcatctgcgc ttcccacgca 240 
cgcccgttgt tgctccc 300 
cggcctcaac ccaccccccc 360 
tcggcgatac atacccgcca 420 
tataccagca ggagaggcgc 480 
ggtacttgta aggtggtcat 540 
gcttccaatg tccatctgga 600 
acagtcatat gtaatagctc 660 
agcacgaatg tcgctcatcc 720 
caactaaggt accttaccgt 780 
tgccgtgcct acctgcttta 840 
tcctctcctc tttttttttt 900 
gacattctct cctaatcacc 960 
aatcatcacc acgatcgtcc 1020 
ctcgtgccct tgacgagcta 1080 
atttacccct cctccctctc 1140 
ctctttggaa gctcacgacc 1200 

1232 



<210> 11 

<211> 1129 

<212> DNA 

<213> Tri choderma reesel 



<220> 

<221> terminator 

<222> (1) . . (1129) 

<223> T. reesei gpdl terminator 



<400> 11 

ggatcccgag cattgtctat gaatgcaaac aaaaatagta aataaatagt aattctggcc 60 

atgacgaata gagccaatct gctccacttg actatcttgt gactgtatcg tatgtcgaac 120 

ccttgactgc ccattcaaac aattgtaaag gaatatagct acaagttatg tctcacgttt 180 

gcgtgcgagc ccgtttgtac gttattttga gaaagcgttg ccatcacatg ctcacagtca 240 

cttggcttac gatcatgttt gcgatcttcg gtaagaatac acagagtaac gattatctcc 300 

atcgcttcta tgattaggta ctcagacaac acatgggaaa caagataacc atcgcatgca 360 

aggtcgattc caatcatgat ctggactggg gtattccatc taagccatag taccctc^ag 420 

agaaggaatg gtaggacctc tcaggcgtcc accatctgtg ctgcaaatcc aagaaacccc 480 

ccaaaagcac ctacctatct acctagagta actgcacgag aaaagaaaag gagcagaaga 540 

agaatgatct caagaggccg tgaacgcaga aacacactcc tcccaacttt tcaagttttg 600 

aacaaaaaaa gaaagatgag gactagaaga tggagtattt ccttcttaga gagctctcgg 660 

tgaggtgacc tgtcagggtt taccgcaaac cgtcggtggt tctatccaat taatcaagtc 720 

ccgcgcctcg cctcttctct cctgtccttt catagaatcc cgtctccttg ttgcttgatc 780 

gaagcggggt tatcgacgcc accaaagatc ttgtcttggt gacttatcaa tcctttggtg 840 

atcaaacagc ccccgagtga tcagatccgt aaaagaagaa gaagagtacg atttaaccag 900 

accgaggaac aataaagcga gtaaataaca tcaaaataag agtctcgttg aaaattactt 960 

gttcctcaat caatcccaac ccccctaaaa gcccttcccc ccatggtata tcccggcagt 1020 

aggagagaga tatttccact accgctcacc accaagtgag gcttgccgag agaagaggat 10B0 

gaatcagaag tgacaacaac gggttgagca catgggatat cggcgcgcc 1129 
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<210> 12 
<211> 5733 
<212> DWA 

<213> Aspergillus nidulans 



<220> 

<223> (1-5733) Sequence of plasmid pAN52-l 



<220> 

<221> promoter 

<222> (1) . . (2129) 

<223> A. nidulans gpdA promoter 

<220> 

<22l> gene 

<222> (2130) . . (2304) 

<223> A. nidulans gpdA gene 

<220> 

<22l> terminator 

<222> (2305) . . (3071) 

<223> A. nidulans trpC terminator 

<220> 

<221> misc_feature 

<222> (3072) . . (5726) 

<223> pUClB from Sail to EcoRI 



<400> 12 

caattccctt gtatctctac acacaggctc aaatcaataa gaagaacggt tcgtcttttt 60 

cgtttatatc ttgcatcgtc ccaaagctat tggcgggata ttctgtttgc agttggctga 120 

cttgaagtaa tctctgcaga tctttcgaca ctgaaatacg tcgagcctgc tccgcttgga 180 

agcggcgagg agcctcgtcc tgtcacaact accaacatgg agtacgataa gggccagttc 240 

cgccagctca ttaagagcca gttcatgggc gttggcatga tggccgtcat gcatctgtac 300 

ttcaagtaca ccaacgctct tctgatccag tcgatcatcc gctgaaggcg ctttcgaatc 360 

tggttaagat ccacgtcttc gggaagccag cgactggtga cctccagcgt ccctttaagg 420 

ctgccaacag ctttctcagc cagggccagc ccaagaccga caaggcctcc ctccagaacg 480 

ccgagaagaa ctggaggggt ggtgtcaagg aggagtaagc tccttattga agtcggagga 540 

cggagcggtg tcaagaggat attcttcgac tctgtattat agataagatg atgaggaatt 6 00 

ggaggtagca tagcttcatt tggatttgct ttccaggctg agactctagc ttggagcata 660 

gagggtcctt tggctttcaa tattctcaag tatctcgagt ttgaacttat tccctgtgaa 720 

ccttttattc accaatgagc attggaatga acatgaatct gaggactgca atcgccatga 780 

ggttttcgaa atacatccgg atgtcgaagg cttggggcac ctgcgttggt tgaatttaga 840 

acgtggcact attgatcatc cgatagctct gcaaagggcg ttgcacaatg caagtcaaac 900 

gttgctagca gttccaggtg gaatgttatg atgagcattg tattaaatca ggagatatag 960 

catgatctct agttagctca ccacaaaagt cagacggcgt aaccaaaagt cacacaacac 1020 

aagctgtaag gatttcggca cggctacgga agacggagaa gccaccttca gtggactcga 1080 

gtaccattta attctatttg tgtttgatcg agacctaata cagcccctac aacgaccatc 1140 

aaagtcgtat agctaccagt gaggaagtgg actcaaatcg acttcagcaa catctcctgg 1200 

ataaacttta agcctaaact atacagaata agataggtgg agagcttata ccgagctccc 1260 

aaatctgtcc agatcatggt tgaccggtgc ctggatcttc ctatagaatc atccttattc 1320 

gttgacctag ctgattctgg agtgacccag agggtcatga cttgagccta aaatccgccg 1380 

cctccaccat ttgtagaaaa atgtgacgaa ctcgtgagct ctgtacagtg accggtgact 1440 

ctttctggca tgcggagaga cggacggacg cagagagaag ggctgagtaa taagccactg 1500 

gccagacagc tctggcggct ctgaggtgca gtggatgatt attaatccgg gaccggccgc 1560 

ccctccgccc cgaagtggaa aggctggtgt gcccctcgtt gaccaagaat ctattgcatc 1620 

atcggagaat atggagcttc atcgaatcac cggcagtaag cgaaggagaa tgtgaagcca 1680 

ggggtgtata gccgtcggcg aaatagcatg ccattaacct aggtacagaa gtccaattgc 1740 
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ttccgatctg gtaaaagatt cacgagatag taccttctcc gaagtaggta gagcgagtac 1800 

ccggcgcgta agctccctaa ttggcccatc cggcatctgt agggcgtcca aatatcgtgc 1860 

ctctcctgct ttgcccggtg tatgaaaccg gaaaggccgc tcaggagctg gccagcggcg 1920 

cagaccggga acacaagctg gcagtcgacc catccggcgc tctgcactcg acctgctgag 1980 

gtccctcagt ccctggtagg cagctttgcc ccgtctgtcc gcccggtgt.g tcggcggggt 2040 

tgacaaggtc gttgcgtcag tccaacattt gttgccatat tttcctgctc tccccaccag 2100 

ctgctctttt ctttcctctt tcttttccca tcctcagtat aCtcatcttc ccatccaaga 2160 

acctttattt cccctaagta agtactttgc tacatccata ctccatcctt cccatccctt 2220 

attcctttga acctttcagt tcgagctttc ccacttcatc gcagcttgac taacagctac 2280 

cccgcttgag cagacatcac catggatcca cttaacgtta ctgaaatcat caaacagctc 2340 

gacgaatctg gatataagat cgttggtgtc gatgtcagct ccggagttga gacaaatggt 2400 

gttcaggatc tcgataagat acgttcattt gtccaagcag caaagagtgc cttctagtga 2460 

tttaatagct ccatgtcaac aagaataaaa cgcgttttcg ggtttacctc ttccagatac 2520 

agctcatctg caatgcatta atgcattgac tgcaacctag taacgccttn caggctccgg 2580 

cgaagagaag aatagcttag cagagctatt ttcattttcg ggagacgaga tcaagcagat 2640 

caacggtcgt caagagacct acgagactga ggaatccgct cttggctcca cgcgactata 27O0 

tatttgtctc taattgtact ttgacatgct cctcttcttt actctgatag cttgactatg 2760 

aaaattccgt caccagcncc tgggttcgca aagataattg catgtttctt ccttgaactc 2820 

tcaagcctac aggacacaca ttcatcgtag gtataaacct cgaaatcant tcctactaag 2880 

atggtataca atagtaacca tgcatggttg cccagtgaat gctccgtaac acccaatacg 2940 

ccggccgaaa cttttttaca actctcctat gagtcgttta cccagaatgc acaggtacac 3000 

ttgtttagag gcaatccttc tttctagaag tcctcgcgta ctgtgtaagc gcccactcca 3060 

catctccact cgacctgcag gcatgcaagc ttggcaccgg ccgccgtttt acaacgtcgt 3120 

gactgggaaa accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc 3180 

agctggcgta atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg 3240 

aatggcgaat ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac 3300 

cgcatatggt gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga 3360 

cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgctCac 3420 

agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg 3 480 

aaacgcgcga gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata 3540 

ataatggttt cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt 3600 

tgtttatttc tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa 3660 

atgcttcaat aatattgaaa aaggaagagt atgagtattc aacattCccg tgtcgccctt 3720 

attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa 3780 

gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac 3840 

agcggtaaga tcct-tgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt 3900 

aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt 3960 

cgccgcacac actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat 4020 

cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac 4080 

actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg 4140 

cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc 4200 

ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa 4260 

ctattaaccg gcgaactact tactctagct tcccggcaac aattaataga ctggatggag 4 320 

gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct 4380 

gataaatctg gagccggtga gcgtgggtct cgcggtatca ccgcagcact ggggccagat 4440 

ggtaagccct cccgtatcgt: agttatctac acgacgggga gtcaggcaac tatggatgaa 4 500 

cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac 4560 

caagtttact catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc 4620 

taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc 4680 

cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg 4740 

cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtccgccg 4800 

gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca 4860 

aatactgtcc ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg 4920 

cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg 4 980 

cgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga 5040 

acggggggct cgtgcacaca gcccagcttg gagcgaacga cccacaccga actgagatac 5100 

ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat 5160 

ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc 5220 
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tggtatcttt atagtcctgt cgggtxtcgc cacctctgac ttgagcgtcg atttttgtga 5280 

tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc 534C 

ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg 5400 

gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag 5460 

cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc 5520 

gcgcgttggc cgattcatta atgcagccgg cacgacaggt ttcccgactg gaaagcgggc 5580 

agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac 5640 

tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga 5700 

aacagctatg accatgatta cgaatcgcgg ccg 5733 



<210> 13 
<211> 39 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 13 

gtcaaccgcg gactgcgcat catgaagttc ttcgccatc 3 9 



<210> 14 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; PCR 5' primer 

<400> 14 

tctagcaagc ttggctctag ttctggaacc gcaccaggcg gcagcaacgg caacggcaat 60 
gtttgc 66 



<210> 15 

<2ii> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 15 

tcgtacaagc tttcaagcac cgacggcggt 3 0 



<210> 16 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 5' primer 



<400> 16 

tctagctcta gaagcaacgg caacggcaat gtt 



33 
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<210> 17 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 

<4Q0> 17 

tgctagtcga cctgctagca gcaccgacgg cggtctg 37 



<210> 18 
<211> 4614 
<212> DNA 

<213> Saccharomyces cerevisiae 
<220> 

<223> S. cerevisiae FLOl coding sequence 
<400> 18 

atgacaatgc ctcatcgcta tatgtttttg gcagtcttta cacttctggc actaactagt 60 

gtggcctcag gagccacaga ggcgtgctta ccagcaggcc agaggaaaag tgggatgaat 120 

ataaattttt accagtattc attgaaagat tcctccacat attcgaatgc agcatatatg 180 

gcttatggat atgcctcaaa aaccaaacta ggttctgtcg gaggacaaac tgatatctcg 240 

attgattata atattccctg tgttagttca tcaggcacat ttccttgtcc tcaagaagat 300 

tcctatggaa actggggatg caaaggaatg ggtgcttgtt ctaatagtca aggaattgca 360 

tactggagta ctgatttatt tggtttctat actaccccaa caaacgtaac cctagaaatg 420- 

acaggttatt ttttaccacc acagacgggt tcttacacat tcaagtttgc tacagttgac 480 

gactctgcaa ttctatcagt aggtggtgca accgcgttca actgttgtgc tcaacagcaa 540 

ccgccgatca catcaacgaa ctttaccatt gacggtatca agccatgggg tggaagtttg 600 

ccacctaata tcgaaggaac cgtctatatg tacgctggct actattatcc aatgaaggtt 660 

gtttactcga acgctgtttc ttggggtaca cttccaatta gtgtgacact tccagatggt 720 

accactgtaa gtgatgactt cgaagggtac gtctattcct ttgacgatga cctaagtcaa 780 

tctaactgta ctgtccctga cccttcaaat tatgctgtca gtaccactac aactacaacg 840 

gaaccatgga ccggtacttt cacttctaca tctactgaaa tgaccaccgt caccggtacc 900 

aacggcgttc caactgacga aaccgtcatt gtcatcagaa ctccaacaac tgctagcacc 960 

atcataacta caaccgagcc atggaacagc acttttacct ctacttctac cgaattgacc 1020 

acagtcactg gcaccaatgg tgtacgaact gacgaaacca tcattgtaat cagaacacca 108 0 

acaacagcca ctactgccat aactacaact gagccatgga acagcacttt tacctctact 1140 

tctaccgaat tgaccacagt caccggtacc aatggtttgc caactgatga gaccatcatt 1200 

gtcatcagaa caccaacaac agccactact gccatgacta caactcagcc atggaacgac 1260 

acttttacct ctacatccac tgaaatgacc accgtcaccg gtaccaacgg tttgccaact 1320 

gatgaaacca tcattgtcat cagaacacca acaacagcca ctactgctat gactacaact 1380 

cagccatgga acgacacttt tacctctaca tccactgaaa tgaccaccgt caccggtacc 1440 

aacggtttgc caactgatga aaccatcatt gtcatcagaa caccaacaac agccactact 1500 

gccatgacta caactcagcc atggaacgac acttttacct ctacatccac tgaaatgacc 1560 

accgtcaccg gtaccaatgg tttgccaact gatgagacca tcattgtcat cagaacacca 1620 

acaacagcca ctactgccat gactacaact cagccatgga acgacacttt tacctctaca 1680 

tccactgaaa tgaccaccgt caccggtacc aacggtttgc caactgatga aaccatcatt 1740 

gtcatcagaa caccaacaac agccactact gccataacta caactgagcc atggaacagc 1800 

acttttacct ctacttctac cgaattgacc acagtcaccg gtaccaatgg tttgccaact I860 

gatgagacca tcattgtcat cagaacacca acaacagcca ctactgccat gactacaact 1920 

cagccatgga acgacacttt tacctctaca tccactgaaa tgaccaccgt caccggtacc 1980 

aacggtttgc caactgatga aaccatcatt gtcatcagaa caccaacaac agccactact 2040 

gccatgacta caactcagcc atggaacgac acttttacct ctacatccac tgaaatgacc 2100 

accgtcaccg gtaccaacgg tttgccaact gatgagacca tcattgtcat cagaacacca 2160 



WO 00/58342 



10 



PO7FI00/00249 



acaacagcca ctactgccat gactacaact cagccatgga acgacacttt tacctctaca 2220 

tccactgaaa tgaccaccgt caccggtacc aacggcgttc caactgacga aaccgtcatt 228 0 

gtcatcagaa ctccaactag tgaaggtcta atcagcacca ccactgaacc atggactggt 2340 

actttcacct ctacatccac tgagatgacc accgccaccg gcactaacgg tcaaccaact 24 00 

gacgaaaccg tgattgttat cagaactcca accagtgaag gtttggttac aaccaccact 2460 

gaaccatgga ctggtacttt tacttctaca tccactgaaa tgaccaccat tactggaacc 2520 

aacggcgttc caactgacga aaccgtcatt gtcatcagaa ctccaaccag tgaaggtcta 2580 

atcagcacca ccactgaacc atggactggt acttttactt ctacatctac tgaaatgacc 264 0 

accattactg gaaccaatgg tcaaccaact gacgaaaccg ttattgttat cagaactcca 2700 

actagtgaag gtctaatcag caccaccact gaaccatgga ctggtacttt cacttctaca 2760 

tctactgaaa tgaccaccgt caccggtacc aacggcgttc caactgacga aaccgtcatt 2820 

gtcatcagaa ctccaaccag tgaaggtcta atcagcacca ccactgaacc atggactggc 2880 

actttcactt cgacttccac tgaggttacc accatcactg gaaccaacgg tcaaccaact 2940 

gacgaaactg tgattgttat cagaactcca accagtgaag gtctaatcag caccaccact 3000 

gaaccatgga ctggtacttt cacttctaca tctgctgaaa tgaccaccgt caccggtact 3060 

aacggtcaac caactgacga aaccgtgatt gttatcagaa ctccaaccag tgaaggtttg 3120 

gttacaacca ccactgaacc atggactggt acttttactt cgacttccac tgaaatgtct 3180 

actgtcactg gaaccaatgg cttgccaact gatgaaactg tcattgttgt caaaactcca 3240 

actactgcca tctcatccag tttgtcatca tcatcttcag gacaaatcac cagctctatc 3300 

acgtcttcgc gtccaattat taccccattc tatcctagca atggaacttc tgtgatttct 3360 

tcctcagtaa tttcttcctc agtcacttct tctctattca cttcttctcc agtcatttct 3420 

tcctcagtca tttcttcttc tacaacaacc tccacttcta tattttctga atcatctaaa 3480 

tcatccgtca ttccaaccag tagttccacc tctggttctt ctgagagcga aacgagttca 3540 

gctggttctg cctcttcttc ctcttttatc tcttctgaat catcaaaatc tcctacatat 3600 

tcttcttcat cattaccact tgttaccagt gcgacaacaa gccaggaaac tgcttcttca 3660 

ttaccacctg ctaccactac aaaaacgagc gaacaaacca ctttggttac cgtgacatcc 3720 

tgcgagtctc atgtgtgcac tgaatccatc tcccctgcga ttgtttccac agctactgtt 3780 

actgttagcg gcgtcacaac agagtatacc acatggtgcc ctatttctac tacagagaca 3840 

acaaagcaaa ccaaagggac aacagagcaa accacagaaa caacaaaaca aaccacggta 3900 

gttacaattt cttcttgtga atctgacgta tgctctaaga ctgcttctcc agccattgta 3960 

tctacaagca ctgctactat taacggcgtt actacagaat acacaacatg gtgtcctatt 4020 

tccaccacag aatcgaggca acaaacaacg ctagttactg ttacttcctg cgaatctggt 4080 

gtgtgttccg aaactgcttc acctgccatt gtttcgacgg ccacggctac tgtgaatgat 4140 

gttgttacgg tctatcctac atggaggcca cagactgcga atgaagagtc tgtcagctct 4200 

aaaatgaaca gtgctaccgg tgagacaaca accaatactt tagctgctga aacgactacc 4260 

aatactgtag ctgctgagac gattaccaat actggagctg ctgagacgaa aacagtagtc 4320 

acctcttcgc tttcaagatc taatcacgct gaaacacaga cggcttccgc gaccgatgtg 4380 

attggtcaca gcagtagtgt tgtttctgta tccgaaactg gcaacaccaa gagtctaaca 4440 

agttccgggt tgagtactat gtcgcaacag cctcgtagca caccagcaag cagcatggta 4500 

ggatatagta cagcttcttt agaaatttca acgtatgctg gcagtgccaa cagcttactg 4560 

gccggtagtg gtttaagtgt cttcattgcg tccttattgc tggcaattat ttaa 4614 



<210> 19 
<211> 5857 
<212> DNA 

<213> Saccharomyces cerevisiae and E . coli 
<220> 

<221> promoter 
<222> (1} . . (452) 

<223> S. cerevisiae GAL1 promoter 
<220> 

<223> (476-495) E. coli T7 promoter/priming site 
<220> 

<223> (502-601) E. coli multiple cloning site 
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<220> 

<223> (609-857) S. cerevisiae CYC1 transcription terminator 
<220> 

<223> ( 1039-1712) E. coli pMBl (pUC-derivedt origin 
<220> 

<221> gene 

<222> (1857) . . (2717) 

<223> E. coli ampicillin resistance gene 
<220> 

<221> gene 

<222> (2735) . . (3842) 

<223> S. cerevisiae URA3 gene 

<220> 

<223> (3846-5317) S, cerevisiae 2 micron origin 
<220> 

<223> (5385-5840) £. coli fl origin 
<220> 

<223> (1-5857) Sequence of pYES2 
<400> 19 

acggattaga agccgccgag cgggtgacag ccctccgaag gaagactctc ctccgtgcgt 6 0 
cctcgtcctc accggtcgcg ttcctgaaac gcagatgtgc ctcgcgccgc actgctccga 120 
acaataaaga ttctacaata ctagctttta tggttatgaa gaggaaaaat tggcagtaac 180 
ctggccccac aaaccttcaa atgaacgaat caaattaaca accataggat gataatgcga 240 
ttagtttttt agccttattt ctggggtaat taatcagcga agcgatgatt tttgatctat 300 
taacagatat ataaatgcaa aaactgcatt aaccacttta actaatactt tcaacatttt 360 
cggtttgtat tacttcttat tcaaatgtaa taaaagtatc aacaaaaaat tgttaatata 420 
cctctatact ttaacgtcaa ggagaaaaaa ccccggatcg gactactagc agctgtaata 480 
cgactcacta tagggaatat taagcttggt accgagctcg gatccactag taacggccgc 540 
cagtgtgctg gaattctgca gatatccatc acactggcgg ccgctcgagc atgcatctag 600 
agggccgcat catgtaatta gttatgtcac gcttacattc acgccctccc cccacatccg 660 
ctctaaccga aaaggaagga gttagacaac ctgaagtcta ggtccctatt tattttttta 720 
tagttatgtt agtattaaga acgtcattta tatttcaaat ttttcttttt tttctgtaca 780 
gacgcgtgta cgcatgtaac attatactga aaaccttgct tgagaaggtt ttgggacgct 84 0 
cgaaggcttt aatttgcggc cctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 900 
ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 960 
ctgcggcgag cggtatcagc tcactcaaag gcggCaatac ggttatccac agaatcaggg 1020 
gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa agcccaggaa ccgtaaaaag 1080 
gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 1140 
cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 1200 
ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 1260 
tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg 1320 
gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 1380 
tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 1440 
ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 1500 
ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 1560 
ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 1620 
accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 1680 
tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 1740 
cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 1800 
taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 1860 
caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 1920 
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gcctgactcc ccgtcgtgta gataactacg atacgggagc gcttaccatc tggccccagt 1980 

gctgcaatga taccgcgaga cccacgccca ccggctccag atttatcagc aataaaccag 2040 

ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc cattcagtct 2100 

attaattgtc gccgggaagc tagagtaagc agttcgccag tcaacagttc gcgcaacgtt 2160 

gccggcattg ctacaggcat cgtggtgtca ctctcgtzcgt ttggtatggc ctcattcagc 2220 

tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 2280 

agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtc atcactcatg 2340 

gtnacggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 2400 

actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 2460 

tgcccggcgt caatacggga taatagtgta tcacatagca gaactttaaa agtgctcatc 2520 

attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 258 0 

tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 264 0 

tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 2700 

aaatgttgaa tactcatact cttccttttt caatgggtaa taactgatat aattaaattg 2760 

aagctctaat ttgtgagttt agtatacatg catttactta taatacagtt ttttagtttt 2S20 

gctggccgca tcttctcaaa tatgcttccc agcctgcttt tctgtaacgt tcaccctcta 2880 

ccttagcatc ccttcccttt gcaaatagtc ctcttccaac aataataatg tcagatcctg 2940 

tagagaccac atcatccacg gttctatact gttgacccaa tgcgtctccc ttgtcatcta 3000 

aacccacacc gggtgtcata atcaaccaat cgtaaccttc atctcttcca cccatgtctc 3060 

tttgagcaat aaagccgata acaaaatctt tgtcgctctt cgcaatgtca acagtaccct 3120 

tagtatattc tccagtagat agggagccct tgcatgacaa ttctgctaac atcaaaaggc 318 0 

ctctaggttc ctttgttact tcttctgccg cctgcttcaa accgctaaca atacctgygc 324 0 

ccaccacacc gtgtgcattc gtaatgtctg cccatcctgc tattctgtat acacccgcag 3300 

agtactgcaa tttgactgta Ctaccaatgt cagcaaattt tctgtcttcg aagagtaaaa 3360 

aattgtactt ggcggataat gcctttagcg gcttaactgt gccctccatg gaaaaatcag 342 0 

tcaagatatc cacatgtgtt tttagtaaac aaattttggg acctaatgct tcaactaact 3480 

ccagtaattc cttggtggta cgaacatcca atgaagcaca caagtttgtt tgcttttcgt 354 0 

gcatgatatt aaatagcttg gcagcaacag gactaggatg agtagcagca cgttccttat 3600 

atgtagcttt cgacatgatt tatcttcgtt tcctgcaggt ttttgttctg tgcagttggg 366 0 

ttaagaatac tgggcaattt catgtttctt caacactaca tatgcgtata tataccaatc 3720 

taagtctgtg ctccttcctt cgttcttcct tctgttcgga gattaccgaa tcaaaaaaat 3780 

ttcaaagaaa ccgaaatcaa aaaaaagaat aaaaaaaaaa tgatgaattg aattgaaaag 384 0 

ctagcttatc gatgataagc tgtcaaagat gagaattaat tccacggact atagactata 3900 

ctagatactc cgtctaccgt acgatacact tccgctcagg tccttgtcct ttaacgaggc 3960 

cttaccactc ttttgttact ctattgatcc agctcagcaa aggcagtgtg atctaagatt 4020 

ctatcctcgc gatgtagtaa aactagctag accgagaaag agactagaaa tgcaaaaggc 4 080 

acttctacaa tggctgccat cattattatc cgatgtgacg ctgcagcttc tcaatgatat 4140 

tcgaatacgc tttgaggaga tacagcctaa tatccgacaa actgctttac agatttacga 4200 

tcgtacttgt tacccatcat tgaattttga acatccgaac ctgggagttt tccctgaaac 4260 

agatagtata tttgaacctg tataataata tatagtctag cgctctacgg aagacaatgt 4320 

atgtatttcg gttcctggag aaactattgc atctattgca taggtaatcc tgcacgCcgc 4380 

atccccggtt cattttctgc gtttccatct tgcacttcaa tagcatatct ttgttaacga 444 0 

agcatctgtg cctcattttg tagaacaaaa atgcaacgcg agagcgctaa tttttcaaac 4500 

aaagaatctg agctgcattt ttacagaaca gaaatgcaac gcgaaagcgc tattttacca 4560 

acgaagaatc tgtgcttcat ttttgtaaaa caaaaacgca acgcgacgag agcgctaatt 462 0 

ttccaaacaa agaatctgag ctgcattttt acagaacaga aatgcaacgc gagagcgcta 4680 

ttttaccaac aaagaatcta tacttctttt Ctgctctaca aaaatgcatc ccgagagcgc 4740 

tatttttcta acaaagcatc ttagattact ttttttctcc tttgtgcgct ccataatgca 4800 

gtctcctgat aactttttgc actgtaggtc cgttaaggtt agaagaaggc tactttggtg 4860 

tctattttct cttccataaa aaaagcctga ctccacttcc cgcgtttact gattactagc 4920 

gaagctgcgg gtgcattttt tcaagataaa ggcatccccg attatattct ataccgatgt 4980 

ggactgcgca tactttgtga acagaaagtg atagcgccga tgattcctca Ctggtcagaa 5040 

aattacgaac ggtttcttct attccgtctc tatatactac gtataggaaa tgtttacatt 5100 

ttcgtattgt tttcgactca ctctatgaat agttctcact acaatttttt tgtctaaaga 5160 

gtaatactag agataaacat aaaaaatgta gaggtcgagt ttagatgcaa gttcaaggag 522 0 

cgaaaggtgg atgggtaggt tatataggga tatagcacag agatatatag caaagagata 528 0 

cttttgagca atgtttgtgg aagcggtatt cgcaatggga agctccaccc cggttgataa 534 0 

tcagaaaagc cccaaaaaca ggaagattgt ataagcaaat atttaaattg taaacgttaa 5400 
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tattttgtta aaattcgcgt taaatttttg ttaaatcagc tcatttttta acgaatagcc 5460 

cgaaatcggc aaaatccctt ataaatcaaa agaatagacc gagatagggt tgagtgttgt 5520 

tccagtttcc aacaagagtc cactattaaa gaacgtggac tccaacgtca aagggcgaaa 5580 

aagggtctat cagggcgatg gcccactacg tgaaccatca ccctaatcaa gttttttggg 5640 

gtcgaggtgc cgtaaagcag taaatcggaa gggtaaacgg atgcccccat ttagagcttg 5700 

acggggaaag ccggcgaacg tggcgagaaa ggaagggaag aaagcgaaag gagcgggggc 5760 

tagggcggtg ggaagtgtag gggtcacgct gggcgtaacc accacacccg ccgcgcttaa 5820 

tggggcgcta cagggcgcgt ggggatgatc cactagt 5857 



<210> 20 
<211> 403 
<212> DNA 

<213> Trichoderma reesei 



<220> 

<223> (1-403) T. reesei hft>2 coding sequence 



<220> 

<221> intron 
<222> (131) . . (200) 



<220> 

<221> intron 
<222> (287) . . (358) 



<400> 20 

atgcagttct tcgccgtcgc 

ggcctcttct ccaaccctct 

tgcaagaccc gtatgttgaa 

acttacacga tgctttacag 

ctgtgccagc aagggctcca 

caatggcaaa gaagtaaaaa 

cgaccaggct ctcctgtgcc 



cctcttcgcc accagcgccc 

gtgctgtgcc accaacgtcc 

ttccaatctc tgggcatcct 

ctaccatcgc cgtcgacact 

agcctctttg ctgcgttgct 

gacatttggg cctgggatcg 

agaaggccat cggcaccttc 



tggctgctgc ctgccctacc 60 
tcgacctcat tggcgttgac 120 
gacattggac gatacagttg 180 
ggcgccatct tccaggctca 240 
cccgtggtaa gtagtgctcg 300 
ctaactcttg atatcaaggc 360 
taa 403 



<210> 21 
<211> 59 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 21 

cggaggagct cgacgacttc gagcagcccg agctgcacgc aggctgtctg ccctaccgg 59 



<210> 22 
<211> 29 
<212> DNA 

<2l3> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 3' primer 



<400> 22 

tcattggatc cttagaaggt gccgatggc 



29 
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<210> 23 

<2ll> 679 

<212> DNA 

<213> Schizophyllum commune 
<220> 

<223> (1-679) SC3 coding sequence 



<220> 

<223> (1-92) 1st cDNA 



<220> 

<223> (146-183) 2nd cDNA 
<220> 

<223> (240-317) 3rd CDNA 



<220> 

<223> (374-469) 4th cDNA 



<220> 

<223> (524-586) 5th cDNA 
<220> 

<223> (635-679) 6th cDNA 



<400> 23 

atgttcgccc gtctccccgt cgtgttcctc tacgccttcg tcgcgttcgg cgccctcgtc 60 

gctgccctcc caggtggcca cccgggcacg acgtacgtcg acctctcacc gtcctctaat 120 

gtcttgctga tgaagccccg tatagcacgc cgccggttac gacgacggtg acggtgacca 180 

cggtgagtag ctttctcgcc gtcgacgact cgaacgcatt ggctaatttt tgctcatagc 240 

cgccctcgac gacgaccatc gccgccggtg gcacgtgtac tacggggtcg ctctcttgct 300 

gcaaccaggt tcaatcggta cgtacatcaa agcggcacga ccaggcatct cagctgacgg 360 

ccacatcgta caggcgagca gcagccctgt taccgccctc ctcggcctgc tcggcattgt 420 

cctcagcgac ctcaacgttc tcgttggcat cagctgctct cccctcactg tgagatcttt 480 

ttgttcactg tcccaattac tgcgcactga cagactttgc caggtcatcg gtgtcggagg 540 

cagcggctgt tcggcgcaga ccgtctgctg cgaaaacacc caattcgtat gtatactttc 600 

catgcgtgtc cctttctccg ctaatcatct gtagaacggg ctgatcaaca tcggttgcac 660 
ccccatcaac atcctctga 679 



<210> 24 
<211> 63 
<212> DNA 

<213:> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 5' primer 



<400> 24 

actacacgga ggagctcgac gacttcgagc agcccgagct gcacgcaggg tggccacccg 60 
ggc 63 
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<210> 25 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 25 

tcgnacggat cctcagagga tgttgatggg 30 



<210> 26 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 



<210> 27 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 27 

tgaattccat atgttaggta ccaccggggc ccatgccggt agaagtagaa gccccgggag 60 
caccgacggc ggtctggcac 80 



<210> 28 
<211> 31 
<212> DNA 

<2l3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 



<400> 26 

ggaattccgc ggactgcgca tcatgaagtt cttcgccatc gcc 



43 



<400> 28 

tgaattcggt acccaggctt gctcaagcgt c 



31 



<210> 29 
<211> 34 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
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<400> 29 

tgaattccat atgtcacagg cactgagagt agta 34 



<210> 30 

<211> 48 

<212> DNA 

<2l3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 

<400> 30 

gaattcggta ccctcgtccc tcgcggtccc gccgaagtga acctggtg 48 



<210> 31 

<211> 34 

<212> DNA 

<2l3>> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3" primer 

<400> 31 

tgaattccat atgctaaccc cgtttcatct ccag 34 



<210> 32 
<21l> 918 
<212> DNA 

<213> Trichodexma reesei 
<220> 

<22l> terminator 
<222> (1) . . (918) 

<223> T. reesei hfbl terminator 
<400> 32 

gatgcccgcc cggggtcaag gtgtgcccgt gagaaagccc acaaagtgtt gatgaggacc 60 
atttccggta ctgggaaagt tggctccacg tgtttgggca ggtttgggca agttgtgtag 120 
atattccatt cgtacgccat tcttattctc caatatttca gtacactttt cttcataaat 180 
caaaaagact gctattctct ttgtgacatg ccggaaggga acaattgctc ttggtctctg 24 0 
ttatttgcaa gtaggagtgg gagattcgcc ttagagaaag tagagaagct gtgcttgacc 300 
gtggtgtgac tcgacgagga tggactgaga gtgttaggat taggtcgaac gttgaagtgt 360 
atacaggatc gtctggcaac ccacggatcc tatgacttga tgcaatggtg aagatgaatg 420 
acagtgtaag aggaaaagga aatgtccgcc ttcagctgat atccacgcca atgatacagc 480 
gatatacctc caatatctgt gggaacgaga catgacatat ttgtgggaac aacttcaaac 540 
agcgagccaa gacctcaata tgcacatcca aagccaaaca ttggcaagac gagagacagt 600 
cacattgtcg tcgaaagatg gcatcgtacc caaatcatca gctctcatta tcgcctaaac 660 
cacagattgt ttgccgtccc ccaactccaa aacgttacta caaaagacat gggcgaatgc 720 
aaagacctga aagcaaaccc tttttgcgac tcaattccct cctttgtcct cggaatgatg 780 
atccttcacc aagtaaaaga aaaagaagat tgagataata catgaaaagc acaacggaaa 840 
cgaaagaacc aggaaaagaa taaatctatc acgcaccttg tccccacact aaaagcaaca 900 
9999g9gtaa aatgaaat 918 
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<210> 33 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<40D> 33 

gacctcgatg cccgcccggg gtcaag 26 



<210> 34 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 34 

gtcgacattt cattttaccc ccctcg 26 



<210> 35 

<211> 1190 

<212> DNA 

<2l3> Trichoderma reesei 



<220> 

<221> promoter 

<222> (1) . . (1190) 

<223> T. reesei hfb2 promoter 



<400> 35 

ctcgagcagc tgaagcttgc atgcctgcat cctttgtgag cgactgcatc cattttgcac 60 
acactgccgt cgacgtctct cttccgacct tggccagctg gacaagcaac acaccaatga 120 
cgctttgtat tattagagta tatgcaagtc tcaggactat cgactcaact ctacccaccg 180 
aggacgatcg cggcacgata cgccctcgtt ctcattggcc caagcagacc aactgcccct 24 0 
ggagcaagat tcagcccaag ggagatggac ggcagggcac gccaggcccc caccaccaag 300 
ccactccctt tggccaaatc agcttgcatg tcaagagaca tcgagctgtg ccttgaaatt 360 
actaacaacc agggatggga aacgaagcct gcttttggaa agacaacaat gagagagaga 420 
gagagaggga gagagacaat gagtgccaca aacctggtag tgctccgcca atgcgtctga 480 
aatgtcacat ccgagtcttg gggcctctgt gagaatgtcc agagtaatac gtgttttgcg 54 0 
aatagtcctc tttcttgagg actggatacc tacgataccc tttttgagtt gatgcggtgc 6 00 
tttcgaagta ttatctggag gatagaagac gtctaggtaa ctacacaaaa ggcctatact 660 
ttggggagta gcccaacgaa aggtaactcc tacggcctct tagagccgtc atagatccta 720 
cagcctcttg gagccgtcat agatcacatc tgtgtagacc gacattctat gaataatcat 780 
ctcatcatgg ccacatacta ctacatacgt gtctctgcct acctgacatg tagcagtggc 840 
caagacacca aggccccagc atcaagcctc cctacctatc ccttccattg tacagcggca 900 
gagagattgc gatgagccct ctccctacct acagacggct gacaatgtcc gtataccacc 96 0 
agccaacgtg atgaaaacaa ggacatgagg aacagcctgc gagagctgga agatgaagag 1020 
ggccagaaaa aaaagtataa agaagacctc gattcccgcc atccaacaat cttttccatc 1080 
ctcatcagca cactcatcta caaccatcac cacattcact caactcctct ttctcaactc 1140 
tccaaacaca aacattcttt gtcgaatacc aaccatcacc acctttcaag 1190 
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<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; PCR 5' primer 
<400> 36 

aagcttgcat gcctgcatcc 20 



<210> 37 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 

<400> 37 

ccatggtgaa aggtggtgat ggttgg 26 



<210> 38 
<211> 13 
<212> PRT 

<213> Trichoderma reesei 
<220> 

<223> vild type T. reesei EG1 peptide linker 
<400> 38 

Val Pro Arg Gly Ser Ser Ser Gly Thr Ala Pro Gly Gly 
15 10 



<210> 39 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: modified CBHII linker 
<400> 39 

Gly Ser Ser Ser Gly Thr Ala Pro Gly Gly 
15 10 



<210> 40 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Met/Thrombin 
linker 

<400> 40 

Pro Gly Arg Pro Val Leu Thr Gly Pro Gly Met Gly Thr Ser Thr Ser 
15 10 15 

Ala Gly Pro 



<210> 41 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Met -containing linker 
<400> 41 

Pro Gly Ala Ser Thr Ser Thr Gly Met Gly Pro Gly Gly 
15 10 



<210> 42 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker containing the thrombin 
cleavage site 



<400> 42 

Gly Thr Leu Val Pro Arg Gly Pro Ala Gly Val Asn Leu Val 
15 10 
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